AaRon

AARon99

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago
some1nostr/Ostrich-70B
liked a Space about 1 month ago
huggingface/paper-central
liked a model about 2 months ago
rhymes-ai/Aria
View all activity

Organizations

None yet

AARon99's activity

liked a Space about 1 month ago
New activity in mattshumer/Reflection-Llama-3.1-70B 3 months ago

Changes made in tensors

2
#22 opened 3 months ago by leafspark
updated a collection 4 months ago
Reacted to Xenova's post with 🔥 4 months ago
view post
Post
7890
Introducing Whisper Diarization: Multilingual speech recognition with word-level timestamps and speaker segmentation, running 100% locally in your browser thanks to 🤗 Transformers.js!

Tested on this iconic Letterman interview w/ Grace Hopper from 1983!
- Demo: Xenova/whisper-speaker-diarization
- Source code: Xenova/whisper-speaker-diarization
  • 1 reply
·
upvoted an article 4 months ago
view article
Article

After 500+ LoRAs made, here is the secret

By FPHam
8
Reacted to Csplk's post with 🧠 4 months ago
view post
Post
1379
# Offensive Physical Security Reconnaissance Planning Automation with public facing RTSP streams and Moondream


After some late night casual hacking about on VLMs for criminal attack vector reconnaissance automaton experiments using Moondream (as usual) based image-text-text with pre defined text prompts that are tuned for extracting weakness or customer identity and monitory based theft physical red team engagement reconnaissance and vector of malicious or criminal activity Working on a space. Thanks again for such a wonderful blessing of super power image-text-to-text model with minimal computational power needed @vikhyatk

I have started actually implementing a custom little tool with both static html space sand python gradio spaces on the go which I shall share as hf spaces when done them.

---

vikhyatk/moondream2

vikhyatk/moondream2
  • 1 reply
·
updated a collection 5 months ago
replied to WizardLM's post 7 months ago
view reply

I really appreciate your contributions to the open source community. I don't understand why if it were just the model that had an issue, why the blog, github, and all of your models have been removed?

Reacted to mrfakename's post with 🔥 8 months ago
view post
Post
4118
Today, I'm excited to launch two new models on the TTS Arena: MeloTTS and StyleTTS 2. Both are open sourced, permissively licensed, and highly efficient.

Curious to see how they compare with other leading models? Vote on the TTS Arena ⬇️

TTS-AGI/TTS-Arena

MeloTTS, released by MyShell AI, provides realistic and lifelike text to speech while remaining efficient and fast, even when running on CPU. It supports a variety of languages, including but not limited to English, French, Chinese, and Japanese.

StyleTTS 2 is another fully open sourced text to speech framework. It's permissively licensed, highly-efficient, and supports voice cloning and longform narration. It also provides natural and lifelike speech.

Both are available now to try on the TTS Arena - vote to find which one is better! The leaderboard will be revealed once we collect enough votes.
  • 12 replies
·