prithivMLmods (Prithiv Sakthi)

reacted to pagezyhf's post with 👍 about 3 hours ago

Post

317

Hello Hugging Face Community,

I'd like to share here a bit more about our Deep Learning Containers (DLCs) we built with Google Cloud, to transform the way you build AI with open models on this platform!

With pre-configured, optimized environments for PyTorch Training (GPU) and Inference (CPU/GPU), Text Generation Inference (GPU), and Text Embeddings Inference (CPU/GPU), the Hugging Face DLCs offer:

⚡ Optimized performance on Google Cloud's infrastructure, with TGI, TEI, and PyTorch acceleration.
🛠️ Hassle-free environment setup, no more dependency issues.
🔄 Seamless updates to the latest stable versions.
💼 Streamlined workflow, reducing dev and maintenance overheads.
🔒 Robust security features of Google Cloud.
☁️ Fine-tuned for optimal performance, integrated with GKE and Vertex AI.
📦 Community examples for easy experimentation and implementation.
🔜 TPU support for PyTorch Training/Inference and Text Generation Inference is coming soon!

Find the documentation at https://huggingface.co/docs/google-cloud/en/index
If you need support, open a conversation on the forum: https://discuss.huggingface.co/c/google-cloud/69

reacted to LukeNeumann's post with 🔥 about 13 hours ago

Post

636

Hello Hugging Face community!

I wanted to introduce myself and my company @Overlaiapp . We are a collective of filmmakers, photographers, and AI engineers working on high resolution (8K+) training data.

We plan to share a lot of our datasets with the community and are kicking things off with two curated datasets:

- Overlaiai/OregonCoastin4K

- Overlaiai/SubArcticPolarBear

Overlai.ai Dataset Features

🎥 Oversampled: Every clip is captured in stunning 8K resolution, delivering rich detail ideal for fine tuning scenic landscapes and ocean dynamics.

📸 Variance: Includes close-up details, slow-motion footage of crashing waves, sweeping landscapes, and wildlife shots.

📋 Detailed Metadata: Every clip is paired with structured metadata, including creative descriptions, precise camera movements, lens information, field of view calculations, and shot settings, ensuring AI models can fully understand and replicate real-world cinematography with accuracy.

⚙️ Consistency: Re-thinking training data at the point of capture by "overshooting" a subject, enabling models to learn more nuanced relationships and views across scenes.

🌅 Light: Shot during early morning and sunset light for optimal color contrast and dynamic range, maximizing visual quality for color and lighting-sensitive tasks.

🔍 Curation: Curated specifically for machine learning, providing clean, high-quality data for next generation model training.

posted an update about 24 hours ago

Post

477

Animaker / Anime Engine : 🤗

Only on🚀: https://huggingface.co/strangerzonehf

Demo: prithivMLmods/FLUX-LoRA-DLC

Adapters:
->Model 1: strangerzonehf/Flux-Animeo-v1-LoRA
->Model 2: strangerzonehf/Flux-Animex-v2-LoRA

Collections:
->Flux LoRA Collection: prithivMLmods/flux-lora-collections-66dd5908be2206cfaa8519be
->Stranger Zones: prithivMLmods/stranger-zone-collections-6737118adcf2cb40d66d0c7e
->LoRA Spaces: prithivMLmods/lora-space-collections-6714b72e0d49e1c97fbd6a32
.
.
.
@prithivMLmods

reacted to AdinaY's post with 🔥 1 day ago

Post

1345

Let’s dive into the exciting releases from the Chinese community last week 🔥🚀
More details 👉 https://huggingface.co/zh-ai-community

Code model:
✨Qwen 2.5 coder by Alibaba Qwen
Qwen/qwen25-coder-66eaa22e6f99801bf65b0c2f
✨OpenCoder by InflyAI - Fully open code model🙌
infly/opencoder-672cec44bbb86c39910fb55e

Image model:
✨Hunyuan3D-1.0 by Tencent
tencent/Hunyuan3D-1

MLLM:
✨JanusFlow by DeepSeek
deepseek-ai/JanusFlow-1.3B
deepseek-ai/JanusFlow-1.3B
✨Mono-InternVL-2B by OpenGVlab
OpenGVLab/Mono-InternVL-2B

Video model:
✨CogVideoX 1.5 by ChatGLM
THUDM/CogVideoX1.5-5B-SAT

Audio model:
✨Fish Agent by FishAudio
fishaudio/fish-agent-v0.1-3b

Dataset:
✨OPI dataset by BAAIBeijing
BAAI/OPI

reacted to merve's post with 🤗 2 days ago

Post

1492

Microsoft released LLM2CLIP: a CLIP model with longer context window for complex text inputs 🤯
All models with Apache 2.0 license here microsoft/llm2clip-672323a266173cfa40b32d4c

TLDR; they replaced CLIP's text encoder with various LLMs fine-tuned on captioning, better top-k accuracy on retrieval.
This will enable better image-text retrieval, better zero-shot image classification, better vision language models 🔥
Read the paper to learn more: LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation (2411.04997)

reacted to louisbrulenaudet's post with 🤗 2 days ago

Post

1484

I’ve published a new to simplify model merging 🤗

This dataset facilitates the search for compatible architectures for model merging with @arcee_ai’s mergekit, streamlining the automation of high-performance merge searches 📖

Dataset : louisbrulenaudet/mergekit-configs

reacted to BlinkDL's post with 🔥 3 days ago

Post

2331

RWKV-6-world-v3 (+3.1T tokens) is our best multilingual 7B model as of now: BlinkDL/rwkv-6-world

It's 100% RNN and attention-free. MMLU 54.2% (previous world-v2.1 = 47.9%. note: without eval-boosting tricks such as annealing).

RWKV-7-world-v4 soon :)

posted an update 3 days ago

Post

3063

Funding realism back into the world. 🍺

🔥Stranger Zone's : Super Realism [ strangerzonehf/Flux-Super-Realism-LoRA ]

Demo1 : prithivMLmods/FLUX-LoRA-DLC
Demo2 : prithivMLmods/FLUX-REALISM

Other New adapt.s for dev 🍔 [ updated patch 2 ]

Hosted -> prithivMLmods/FLUX-LoRA-DLC

🍟Digital Chaos: prithivMLmods/Digital-Chaos-Flux-LoRA
🍟Threaded Knitted: prithivMLmods/Knitted-Character-Flux-LoRA
🍟Fashion Hut: prithivMLmods/Fashion-Hut-Modeling-LoRA
🍟Aura 9999: prithivMLmods/Aura-9999
🍟Green Cartoon: prithivMLmods/Green-Cartoon-Flux-LoRA
🍟Pastels: prithivMLmods/Pastel-BG-Flux-LoRA
🍟Retro Pixel: prithivMLmods/Retro-Pixel-Flux-LoRA
🍟CAnime: prithivMLmods/CAnime-LoRA
🍟Pastels: prithivMLmods/Pastel-BG-Flux-LoRA

------------

🎉LoRA Collection: prithivMLmods/flux-lora-collections-66dd5908be2206cfaa8519be

🎉LoRA Spaces: prithivMLmods/lora-space-collections-6714b72e0d49e1c97fbd6a32

🎉Collection Zero:
prithivMLmods/collection-zero-and-demo-recently-updated-65e48a7dd8212873836ceca2
------------
.
.
.
@prithivMLmods 🤗

reacted to ariG23498's post with 🚀 4 days ago

Post

2450

Qwen/qwen25-66e81a666513e518adb90d9e

Qwen/Qwen2.5-Coder-Artifacts

Qwen/Qwen2.5-Coder-demo

posted an update 4 days ago

Post

3394

New adapt.s for dev 🍔

Hosted -> prithivMLmods/FLUX-LoRA-DLC

✨Teen Outfit: prithivMLmods/Teen-Outfit
✨Dark Pink: prithivMLmods/Dark-Thing-Flux-LoRA
✨Shadow Projection: prithivMLmods/Shadow-Projection-Flux-LoRA
✨Abstract Cartoon: prithivMLmods/Abstract-Cartoon-Flux-LoRA
✨Street Bokeh: prithivMLmods/Street-Bokeh-Flux-LoRA
✨Fine Detailed: prithivMLmods/Flux-Realism-FineDetailed
✨Bold Shadows: prithivMLmods/Bold-Shadows-Flux-LoRA
✨Yellow Laser: prithivMLmods/Yellow-Laser-Flux-LoRA

------------

🎉LoRA Collection: prithivMLmods/flux-lora-collections-66dd5908be2206cfaa8519be

🎉LoRA Spaces: prithivMLmods/lora-space-collections-6714b72e0d49e1c97fbd6a32

------------
.
.
.@prithivMLmods 🤗

reacted to Dref360's post with 🤝 4 days ago

Post

2203

Sharing a new space to test out ViTPose, a pose estimation model using Visual Transformers.

[ViTPose Playground]( Dref360/vit_pose_playground)

This model will be available in transformers once [#30530](https://github.com/huggingface/transformers/pull/30530) is merged. Huge shoutout to @nielsr and @danelcsb for bringing this to HF!

Here's the result on my Ken Halloween costume.

2 replies

·

reacted to their post with 🔥 5 days ago

Post

3954

GRID-6X : Layout for Seamless Image Assembly 🔥

🪨Demo: prithivMLmods/GRID-6X
🪨Doc / Blog: https://huggingface.co/blog/prithivMLmods/grid-6x

In the infer function:

grid_img = Image.new('RGB', (width * grid_size_x, height * grid_size_y))
for i, img in enumerate(result.images[:num_images]):
    grid_img.paste(img, (i % grid_size_x * width, i // grid_size_x * height))

1. **Image Initialization**: grid_img is a blank canvas that will hold the images in a grid format.
2. **Image Placement**: Images are pasted onto the canvas using a loop:
- **Horizontal Position**: (i % grid_size_x) * width calculates the x-coordinate.
- **Vertical Position**: (i // grid_size_x) * height calculates the y-coordinate.

1. **Grid Size Selection**: The user selects the grid size from options like "2x1", "1x2", "2x2", "2x3", "3x2", and "1x1". Each option corresponds to the arrangement of images:
- **2x1**: 2 images in a single row
- **1x2**: 1 image in two rows (column layout)
- **2x2**: 2 rows with 2 images each
- **2x3**: 2 rows with 3 images each
- **3x2**: 3 rows with 2 images each
- **1x1**: A single image (default)

2. **Image Generation**: Based on the grid size selection, the app calculates the number of images to generate. For example:
- If the grid size is "2x2", the app generates 4 images.
- For "3x2", it generates 6 images.

-> Each option arranges images accordingly, providing flexibility in viewing multiple images in one output.

-> Added both of these spaces that support the GRID functionality Layout for Seamless Image Assembly :

----------
🔥IMAGINEO-4K: prithivMLmods/IMAGINEO-4K

🔥GRID-6X: prithivMLmods/GRID-6X
----------
.
.
.@prithivMLmods 🤗

reacted to their post with ❤️ 5 days ago

Post

4822

Style flo : : 🎉🤗

{ Try Now on Flux LoRA DLC ⛵ } : prithivMLmods/FLUX-LoRA-DLC

-- Undersea
{ Red Fluid } : prithivMLmods/Red-Undersea-Flux-LoRA

-- 3D Realmix
{ 3D Portrait Render } : prithivMLmods/3D-Render-Flux-LoRA

-- Pop
{ Yellow Pop } : prithivMLmods/Yellow-Pop-Flux-Dev-LoRA

-- Grid
{ Purple Grid } : prithivMLmods/Purple-Grid-Flux-LoRA

{ collections : : }

🚀 Flux LoRA :
prithivMLmods/flux-lora-collections-66dd5908be2206cfaa8519be

🚀Collection zero: prithivMLmods/collection-zero-and-demo-recently-updated-65e48a7dd8212873836ceca2

.
.
@prithivMLmods 🧨

posted an update 5 days ago

Post

3954

GRID-6X : Layout for Seamless Image Assembly 🔥

🪨Demo: prithivMLmods/GRID-6X
🪨Doc / Blog: https://huggingface.co/blog/prithivMLmods/grid-6x

In the infer function:

grid_img = Image.new('RGB', (width * grid_size_x, height * grid_size_y))
for i, img in enumerate(result.images[:num_images]):
    grid_img.paste(img, (i % grid_size_x * width, i // grid_size_x * height))

1. **Image Initialization**: grid_img is a blank canvas that will hold the images in a grid format.
2. **Image Placement**: Images are pasted onto the canvas using a loop:
- **Horizontal Position**: (i % grid_size_x) * width calculates the x-coordinate.
- **Vertical Position**: (i // grid_size_x) * height calculates the y-coordinate.

1. **Grid Size Selection**: The user selects the grid size from options like "2x1", "1x2", "2x2", "2x3", "3x2", and "1x1". Each option corresponds to the arrangement of images:
- **2x1**: 2 images in a single row
- **1x2**: 1 image in two rows (column layout)
- **2x2**: 2 rows with 2 images each
- **2x3**: 2 rows with 3 images each
- **3x2**: 3 rows with 2 images each
- **1x1**: A single image (default)

2. **Image Generation**: Based on the grid size selection, the app calculates the number of images to generate. For example:
- If the grid size is "2x2", the app generates 4 images.
- For "3x2", it generates 6 images.

-> Each option arranges images accordingly, providing flexibility in viewing multiple images in one output.

-> Added both of these spaces that support the GRID functionality Layout for Seamless Image Assembly :

----------
🔥IMAGINEO-4K: prithivMLmods/IMAGINEO-4K

🔥GRID-6X: prithivMLmods/GRID-6X
----------
.
.
.@prithivMLmods 🤗

reacted to their post with 👀👍🔥 6 days ago

Post

5528

New Style, New Mix, New Drop 🧤

🧨Flux LoRA DLC: prithivMLmods/FLUX-LoRA-DLC

🎆Glowing-Body: prithivMLmods/Glowing-Body-Flux-LoRA
🎆Electric-Blue: prithivMLmods/Electric-Blue-Flux-LoRA
🎆Intense-Red: prithivMLmods/Intense-Red-Flux-LoRA
🎆Clouds-Illusion: prithivMLmods/Clouds-Illusion-Flux-LoRA
🎆Digital-Yellow: prithivMLmods/Digital-Yellow-Flux-LoRA

🧨Flux LoRA Collection: prithivMLmods/flux-lora-collections-66dd5908be2206cfaa8519be

.
.
.
@prithivMLmods

posted an update 6 days ago

Post

4822

Style flo : : 🎉🤗

{ Try Now on Flux LoRA DLC ⛵ } : prithivMLmods/FLUX-LoRA-DLC

-- Undersea
{ Red Fluid } : prithivMLmods/Red-Undersea-Flux-LoRA

-- 3D Realmix
{ 3D Portrait Render } : prithivMLmods/3D-Render-Flux-LoRA

-- Pop
{ Yellow Pop } : prithivMLmods/Yellow-Pop-Flux-Dev-LoRA

-- Grid
{ Purple Grid } : prithivMLmods/Purple-Grid-Flux-LoRA

{ collections : : }

🚀 Flux LoRA :
prithivMLmods/flux-lora-collections-66dd5908be2206cfaa8519be

🚀Collection zero: prithivMLmods/collection-zero-and-demo-recently-updated-65e48a7dd8212873836ceca2

.
.
@prithivMLmods 🧨

reacted to their post with ❤️ 7 days ago

Post

3074

FLUX 1.1 [pro] Ultra : API - { 4x Higher Image Resolutions }
Upto - 4 Megapixels, 10 seconds per sample. { Hi - Res }

{ Blog Post ⛵ } : https://huggingface.co/blog/prithivMLmods/flux-pro-endpoint

Endpoint Creation Step by Step: 🧵
-> Sign up to { api.bfl.ml } & get your api's: https://api.bfl.ml/auth/profile
-> File Structure:
flux_image_generation/
├── .env
├── generate_image.py
└── requirements.txt

-> Step 0: Add Your API Key to an Environment File
{ .env }

BFL_API_KEY=your_actual_api_key_here

-> Step 1: Install Required Libraries
{ requirements.txt }

requests
python-dotenv

-> Step 2: Setup the Python Script
{ generate_image.py} - https://github.com/PRITHIVSAKTHIUR/Flux-API/blob/main/generate_image.py

-> Step3: Install the requirements & Run the Script

pip install -r requirements.txt

python generate_image.py

-> Polling: The script polls the API every 0.5 seconds until the image generation result is ready. That's it the script also checks for a successful response after submitting the request.

For more visit:
🔺for script: https://github.com/PRITHIVSAKTHIUR/Flux-API/tree/main
🔺bfl doc: https://docs.bfl.ml/quick_start/gen_image/#__tabbed_1_2

Endpoints for image generation: 🧵
-> /flux-pro-1.1-ultra
-> /flux-pro-1.1
-> /flux-pro
-> /flux-dev

Each ID has 50 free credits available for use, based on the cost per image sample generated by the model.

.
.
.
@prithivMLmods 🤗