Apple Neural Engine LLMs
Collection
CoreML LLMs optimized for Apple Neural Engine.
•
3 items
•
Updated
•
1
CoreML conversion of Llama-3.2-3B-Instruct with a 512 context length. Optimized for Apple Neural Engine.
Use this CLI to download and run inference. macOS 14 (Sonoma) is required.
This model will likley run slowly or not at all on M1 Macs and phones. Consider trying the 1B model for those devices: smpanaro/Llama-3.2-1B-Instruct-CoreML
Base model
meta-llama/Llama-3.2-3B-Instruct