Edit model card

Note: This is our first public release on Hugging Face, and the Model Card is still a work in progress. Further improvements and updates will follow.

CreativeWorksAi + NeuraLake: Designed by Earth's Creatives, Assembled by AI

Model Description

The iSA-01-Mini-3B-GGUF is a small yet advanced language model developed by CreativeWorksAi, designed to enhance text generation and reasoning capabilities. It extends the context window from 128K to 256K tokens, effectively doubling its information retention and significantly improving performance compared to its base model, meta-llama/Llama-3.2-3B-Instruct.

Hardware Requirements Estimate

Name Quant method Size Memory (RAM, vRAM) required
iSA-01-Mini-3B.F16.gguf F16 6.43 GB 12.86 GB
iSA-01-Mini-3B.Q4_K_M.gguf Q4_K_M 2.02 GB 4.04 GB
iSA-01-Mini-3B.Q5_K_M.gguf Q5_K_M 2.32 GB 4.64 GB

Key Features

  • Extended Context Window: The model's context window has been expanded from 128K to 256K tokens, enabling it to retain more information for better reasoning and logical deductions.
  • Enhanced Reasoning: The increased context size leads to superior performance in complex tasks like Retrieval-Augmented Generation (RAG), resulting in more precise and context-aware outputs.
  • Improved Information Integration: With a larger context window, the model integrates external information more effectively, producing accurate and contextually relevant responses.
  • Fine-tuned with NeuraLake/Megalodon: The model was fine-tuned using synthetic data generated by the state-of-the-art NeuraLake/Megalodon, enhancing its ability to process and analyze complex scenarios.
  • NeuraLake/Megalodon Model: This proprietary, closed-source LLM has been developed by NeuraLake over the past three years to enhance reasoning capabilities, especially for small models and agents.

Training Data

The iSA-01-Mini-3B-GGUF was trained using synthetic data generated by NeuraLake/Megalodon, focused on realistic scenarios to improve reasoning and performance in RAG tasks.

Model Details

Usage

CreativeWorksAi's Intelligence System for Advanced Dialogue and Organized Responses Assistance (i.S.A.D.O.R.A. architecture) is designed to offer users a sophisticated tool for generating coherent, contextually rich text, making it ideal for applications that require advanced natural language understanding and generation.

πŸ‡§πŸ‡·

Downloads last month
146
GGUF
Model size
3.21B params
Architecture
llama

4-bit

5-bit

16-bit

Inference API
Unable to determine this model’s pipeline type. Check the docs .

Model tree for CreativeWorksAi/iSA-01-Mini-3B-GGUF

Quantized
(138)
this model

Collection including CreativeWorksAi/iSA-01-Mini-3B-GGUF