Running on Zero 12 π¦Ύπͺπ½ Human Feedback Collector | Meta-Llama-3.1-8B-Instruct | (DPO) LLM, chatbot, human-feedback