Prithiv Sakthi PRO

prithivMLmods

AI & ML interests

Computer Vision - AI/ML

Articles

Organizations

prithivMLmods's activity

reacted to pagezyhf's post with 👍 about 3 hours ago
view post
Post
317
Hello Hugging Face Community,

I'd like to share here a bit more about our Deep Learning Containers (DLCs) we built with Google Cloud, to transform the way you build AI with open models on this platform!

With pre-configured, optimized environments for PyTorch Training (GPU) and Inference (CPU/GPU), Text Generation Inference (GPU), and Text Embeddings Inference (CPU/GPU), the Hugging Face DLCs offer:

⚡ Optimized performance on Google Cloud's infrastructure, with TGI, TEI, and PyTorch acceleration.
🛠️ Hassle-free environment setup, no more dependency issues.
🔄 Seamless updates to the latest stable versions.
💼 Streamlined workflow, reducing dev and maintenance overheads.
🔒 Robust security features of Google Cloud.
☁️ Fine-tuned for optimal performance, integrated with GKE and Vertex AI.
📦 Community examples for easy experimentation and implementation.
🔜 TPU support for PyTorch Training/Inference and Text Generation Inference is coming soon!

Find the documentation at https://huggingface.co/docs/google-cloud/en/index
If you need support, open a conversation on the forum: https://discuss.huggingface.co/c/google-cloud/69
reacted to LukeNeumann's post with 🔥 about 13 hours ago
view post
Post
636
Hello Hugging Face community!

I wanted to introduce myself and my company @Overlaiapp . We are a collective of filmmakers, photographers, and AI engineers working on high resolution (8K+) training data.

We plan to share a lot of our datasets with the community and are kicking things off with two curated datasets:

- Overlaiai/OregonCoastin4K

- Overlaiai/SubArcticPolarBear


Overlai.ai Dataset Features

🎥 Oversampled: Every clip is captured in stunning 8K resolution, delivering rich detail ideal for fine tuning scenic landscapes and ocean dynamics.

📸 Variance: Includes close-up details, slow-motion footage of crashing waves, sweeping landscapes, and wildlife shots.

📋 Detailed Metadata: Every clip is paired with structured metadata, including creative descriptions, precise camera movements, lens information, field of view calculations, and shot settings, ensuring AI models can fully understand and replicate real-world cinematography with accuracy.

⚙️ Consistency: Re-thinking training data at the point of capture by "overshooting" a subject, enabling models to learn more nuanced relationships and views across scenes.

🌅 Light: Shot during early morning and sunset light for optimal color contrast and dynamic range, maximizing visual quality for color and lighting-sensitive tasks.

🔍 Curation: Curated specifically for machine learning, providing clean, high-quality data for next generation model training.
posted an update about 24 hours ago
reacted to AdinaY's post with 🔥 1 day ago
view post
Post
1345
Let’s dive into the exciting releases from the Chinese community last week 🔥🚀
More details 👉 https://huggingface.co/zh-ai-community

Code model:
✨Qwen 2.5 coder by Alibaba Qwen
Qwen/qwen25-coder-66eaa22e6f99801bf65b0c2f
✨OpenCoder by InflyAI - Fully open code model🙌
infly/opencoder-672cec44bbb86c39910fb55e

Image model:
✨Hunyuan3D-1.0 by Tencent
tencent/Hunyuan3D-1

MLLM:
✨JanusFlow by DeepSeek
deepseek-ai/JanusFlow-1.3B
deepseek-ai/JanusFlow-1.3B
✨Mono-InternVL-2B by OpenGVlab
OpenGVLab/Mono-InternVL-2B

Video model:
✨CogVideoX 1.5 by ChatGLM
THUDM/CogVideoX1.5-5B-SAT

Audio model:
✨Fish Agent by FishAudio
fishaudio/fish-agent-v0.1-3b

Dataset:
✨OPI dataset by BAAIBeijing
BAAI/OPI
reacted to merve's post with 🤗 2 days ago
view post
Post
1492
Microsoft released LLM2CLIP: a CLIP model with longer context window for complex text inputs 🤯
All models with Apache 2.0 license here microsoft/llm2clip-672323a266173cfa40b32d4c

TLDR; they replaced CLIP's text encoder with various LLMs fine-tuned on captioning, better top-k accuracy on retrieval.
This will enable better image-text retrieval, better zero-shot image classification, better vision language models 🔥
Read the paper to learn more: LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation (2411.04997)
reacted to louisbrulenaudet's post with 🤗 2 days ago
view post
Post
1484
I’ve published a new to simplify model merging 🤗

This dataset facilitates the search for compatible architectures for model merging with @arcee_ai’s mergekit, streamlining the automation of high-performance merge searches 📖

Dataset : louisbrulenaudet/mergekit-configs
reacted to BlinkDL's post with 🔥 3 days ago
view post
Post
2331
RWKV-6-world-v3 (+3.1T tokens) is our best multilingual 7B model as of now: BlinkDL/rwkv-6-world

It's 100% RNN and attention-free. MMLU 54.2% (previous world-v2.1 = 47.9%. note: without eval-boosting tricks such as annealing).

RWKV-7-world-v4 soon :)
posted an update 3 days ago
view post
Post
3063
reacted to ariG23498's post with 🚀 4 days ago
posted an update 4 days ago
view post
Post
3394
reacted to Dref360's post with 🤝 4 days ago
reacted to their post with 🔥 5 days ago
view post
Post
3954
GRID-6X : Layout for Seamless Image Assembly 🔥

🪨Demo: prithivMLmods/GRID-6X
🪨Doc / Blog: https://huggingface.co/blog/prithivMLmods/grid-6x

In the infer function:
grid_img = Image.new('RGB', (width * grid_size_x, height * grid_size_y))
for i, img in enumerate(result.images[:num_images]):
    grid_img.paste(img, (i % grid_size_x * width, i // grid_size_x * height))

1. **Image Initialization**: grid_img is a blank canvas that will hold the images in a grid format.
2. **Image Placement**: Images are pasted onto the canvas using a loop:
- **Horizontal Position**: (i % grid_size_x) * width calculates the x-coordinate.
- **Vertical Position**: (i // grid_size_x) * height calculates the y-coordinate.

1. **Grid Size Selection**: The user selects the grid size from options like "2x1", "1x2", "2x2", "2x3", "3x2", and "1x1". Each option corresponds to the arrangement of images:
- **2x1**: 2 images in a single row
- **1x2**: 1 image in two rows (column layout)
- **2x2**: 2 rows with 2 images each
- **2x3**: 2 rows with 3 images each
- **3x2**: 3 rows with 2 images each
- **1x1**: A single image (default)

2. **Image Generation**: Based on the grid size selection, the app calculates the number of images to generate. For example:
- If the grid size is "2x2", the app generates 4 images.
- For "3x2", it generates 6 images.

-> Each option arranges images accordingly, providing flexibility in viewing multiple images in one output.

-> Added both of these spaces that support the GRID functionality Layout for Seamless Image Assembly :

----------
🔥IMAGINEO-4K: prithivMLmods/IMAGINEO-4K

🔥GRID-6X: prithivMLmods/GRID-6X
----------
.
.
.@prithivMLmods 🤗
reacted to their post with ❤️ 5 days ago
view post
Post
4822
Style flo : : 🎉🤗

{ Try Now on Flux LoRA DLC ⛵ } : prithivMLmods/FLUX-LoRA-DLC

-- Undersea
{ Red Fluid } : prithivMLmods/Red-Undersea-Flux-LoRA

-- 3D Realmix
{ 3D Portrait Render } : prithivMLmods/3D-Render-Flux-LoRA

-- Pop
{ Yellow Pop } : prithivMLmods/Yellow-Pop-Flux-Dev-LoRA

-- Grid
{ Purple Grid } : prithivMLmods/Purple-Grid-Flux-LoRA

{ collections : : }

🚀 Flux LoRA :
prithivMLmods/flux-lora-collections-66dd5908be2206cfaa8519be

🚀Collection zero: prithivMLmods/collection-zero-and-demo-recently-updated-65e48a7dd8212873836ceca2


.
.
@prithivMLmods 🧨
posted an update 5 days ago
view post
Post
3954
GRID-6X : Layout for Seamless Image Assembly 🔥

🪨Demo: prithivMLmods/GRID-6X
🪨Doc / Blog: https://huggingface.co/blog/prithivMLmods/grid-6x

In the infer function:
grid_img = Image.new('RGB', (width * grid_size_x, height * grid_size_y))
for i, img in enumerate(result.images[:num_images]):
    grid_img.paste(img, (i % grid_size_x * width, i // grid_size_x * height))

1. **Image Initialization**: grid_img is a blank canvas that will hold the images in a grid format.
2. **Image Placement**: Images are pasted onto the canvas using a loop:
- **Horizontal Position**: (i % grid_size_x) * width calculates the x-coordinate.
- **Vertical Position**: (i // grid_size_x) * height calculates the y-coordinate.

1. **Grid Size Selection**: The user selects the grid size from options like "2x1", "1x2", "2x2", "2x3", "3x2", and "1x1". Each option corresponds to the arrangement of images:
- **2x1**: 2 images in a single row
- **1x2**: 1 image in two rows (column layout)
- **2x2**: 2 rows with 2 images each
- **2x3**: 2 rows with 3 images each
- **3x2**: 3 rows with 2 images each
- **1x1**: A single image (default)

2. **Image Generation**: Based on the grid size selection, the app calculates the number of images to generate. For example:
- If the grid size is "2x2", the app generates 4 images.
- For "3x2", it generates 6 images.

-> Each option arranges images accordingly, providing flexibility in viewing multiple images in one output.

-> Added both of these spaces that support the GRID functionality Layout for Seamless Image Assembly :

----------
🔥IMAGINEO-4K: prithivMLmods/IMAGINEO-4K

🔥GRID-6X: prithivMLmods/GRID-6X
----------
.
.
.@prithivMLmods 🤗
reacted to their post with 👀👍🔥 6 days ago
view post
Post
5528
posted an update 6 days ago
view post
Post
4822
Style flo : : 🎉🤗

{ Try Now on Flux LoRA DLC ⛵ } : prithivMLmods/FLUX-LoRA-DLC

-- Undersea
{ Red Fluid } : prithivMLmods/Red-Undersea-Flux-LoRA

-- 3D Realmix
{ 3D Portrait Render } : prithivMLmods/3D-Render-Flux-LoRA

-- Pop
{ Yellow Pop } : prithivMLmods/Yellow-Pop-Flux-Dev-LoRA

-- Grid
{ Purple Grid } : prithivMLmods/Purple-Grid-Flux-LoRA

{ collections : : }

🚀 Flux LoRA :
prithivMLmods/flux-lora-collections-66dd5908be2206cfaa8519be

🚀Collection zero: prithivMLmods/collection-zero-and-demo-recently-updated-65e48a7dd8212873836ceca2


.
.
@prithivMLmods 🧨
reacted to their post with ❤️ 7 days ago
view post
Post
3074
FLUX 1.1 [pro] Ultra : API - { 4x Higher Image Resolutions }
Upto - 4 Megapixels, 10 seconds per sample. { Hi - Res }

{ Blog Post ⛵ } : https://huggingface.co/blog/prithivMLmods/flux-pro-endpoint

Endpoint Creation Step by Step: 🧵
-> Sign up to { api.bfl.ml } & get your api's: https://api.bfl.ml/auth/profile
-> File Structure:
flux_image_generation/
├── .env
├── generate_image.py
└── requirements.txt

-> Step 0: Add Your API Key to an Environment File
{ .env }
BFL_API_KEY=your_actual_api_key_here

-> Step 1: Install Required Libraries
{ requirements.txt }
requests
python-dotenv

-> Step 2: Setup the Python Script
{ generate_image.py} - https://github.com/PRITHIVSAKTHIUR/Flux-API/blob/main/generate_image.py

-> Step3: Install the requirements & Run the Script
pip install -r requirements.txt

python generate_image.py

-> Polling: The script polls the API every 0.5 seconds until the image generation result is ready. That's it the script also checks for a successful response after submitting the request.

For more visit:
🔺for script: https://github.com/PRITHIVSAKTHIUR/Flux-API/tree/main
🔺bfl doc: https://docs.bfl.ml/quick_start/gen_image/#__tabbed_1_2

Endpoints for image generation: 🧵
-> /flux-pro-1.1-ultra
-> /flux-pro-1.1
-> /flux-pro
-> /flux-dev

Each ID has 50 free credits available for use, based on the cost per image sample generated by the model.

.
.
.
@prithivMLmods 🤗
reacted to their post with ❤️ 7 days ago
view post
Post
4572
Quintet Drop : : 🤗

{ Flux LoRA DLC ⛵ } : prithivMLmods/FLUX-LoRA-DLC

-- Purple Dreamy
{ pop of color } : prithivMLmods/Purple-Dreamy-Flux-LoRA

-- Golden Dust
{ shimmer contrast } : prithivMLmods/Golden-Dust-Flux-LoRA

-- Lime Green
{ depth to the composition } : prithivMLmods/Lime-Green-Flux-LoRA

-- Flare Strike
{ Fractured Line } : prithivMLmods/Fractured-Line-Flare

-- Orange Chroma
{ studio lighting } : prithivMLmods/Orange-Chroma-Flux-LoRA
.
.
.
{ collection } : prithivMLmods/flux-lora-collections-66dd5908be2206cfaa8519be

@prithivMLmods