wing lian's picture

wing lian PRO

winglian

·

AI & ML interests

None yet

Organizations

winglian's activity

New activity in axolotl-ai-co/romulus-mistral-nemo-12b-simpo about 2 months ago

Update README.md

#2 opened about 2 months ago by

New activity in deepseek-ai/DeepSeek-Prover-V1.5-Base 3 months ago

the config class and config.json uses DeepseekConfig, not v2

#5 opened 3 months ago by

Match the config class name to what the modeling code expects

#4 opened 3 months ago by

New activity in microsoft/Phi-3.5-mini-instruct 3 months ago

trust_remote_code=True

#9 opened 3 months ago by

New activity in NousResearch/Hermes-2-Pro-Llama-3-8B 7 months ago

add axolotl tag

#1 opened 7 months ago by

New activity in mattshumer/Llama-3-8B-16K 7 months ago

add axolotl tag

#3 opened 7 months ago by

New activity in cognitivecomputations/dolphin-2.9-llama3-8b 7 months ago

add axolotl tag

#12 opened 7 months ago by

New activity in openbmb/Eurus-RM-7b 7 months ago

Enable flash_attention_2 support since the underlying Mistral model supports it

#3 opened 7 months ago by

New activity in meta-llama/Meta-Llama-3-8B 7 months ago

Rename original/tokenizer.model to tokenizer.model

#6 opened 7 months ago by

commented a paper 8 months ago

Octopus v2: On-device language model for super agent

Paper • 2404.01744 • Published Apr 2 • 57 •

New activity in PrunaAI/dbrx-base-bnb-4bit 8 months ago

invalid weights doesn't match modeling code

#3 opened 8 months ago by

New activity in SinclairSchneider/dbrx-base-quantization-fixed 8 months ago

reduce verbosity of logging

#1 opened 8 months ago by

New activity in databricks/dbrx-instruct 8 months ago

The fused expert parameters means load_in_4bit doesn't work properly, nor does LoRA

#10 opened 8 months ago by

New activity in LnL-AI/dbrx-base-converted-v2 8 months ago

reduce logging verbosity

#3 opened 8 months ago by

New activity in SinclairSchneider/dbrx-instruct-quantization-fixed 8 months ago

dbrx-base

#2 opened 8 months ago by

New activity in ai21labs/Jamba-v0.1 8 months ago

finetuning issues

#9 opened 8 months ago by

Fix bias logic to enable QLoRA finetuning

#5 opened 8 months ago by

New activity in cerebras/SlimPajama-627B 11 months ago

Trouble with streaming

#5 opened over 1 year ago by

New activity in open-llm-leaderboard/open_llm_leaderboard 12 months ago

latest commit breaks ability to submit mistral finetunes

#410 opened 12 months ago by

New activity in Open-Orca/Mistral-7B-OpenOrca 12 months ago

Can you share the training configuration of Axolotl?

#24 opened 12 months ago by