Kaito Sugimoto's picture

Kaito Sugimoto

kaisugi

·

https://kaisugi.me

kaisugi

AI & ML interests

Japanese LLMs

Recent Activity

liked a Space 1 day ago

llm-jp/open-japanese-llm-leaderboard

replied to AkimfromParis's post 1 day ago

reacted to AkimfromParis's post with 👍 1 day ago

Organizations

kaisugi's activity

upvoted 2 collections 5 days ago

LLM-jp-3 Fine-tuned Models

Fine-tuned models in the LLM-jp-3 model series • 5 items • Updated 7 days ago • 1

LLM-jp-3 Pre-trained Models

Pre-trained models in the LLM-jp-3 model series • 5 items • Updated 7 days ago • 1

upvoted an article about 1 month ago

Article

How to generate text: using different decoding methods for language generation with Transformers

Mar 1, 2020

• 111

upvoted a collection about 1 month ago

Llama-3.1-Swallow

6 items • Updated 11 days ago • 3

upvoted a paper about 1 month ago

PLaMo-100B: A Ground-Up Language Model Designed for Japanese Proficiency

Paper • 2410.07563 • Published Oct 10 • 2

upvoted 2 collections about 2 months ago

gemma-2-baku

The baku model series are based on the gemma-2 series and have been continually pre-trained on Japanese-specific corpora. • 4 items • Updated Oct 3 • 3

Gemma 2 JPN Release

A Gemma 2 2B model fine-tuned on Japanese text. It supports the Japanese language the same level of performance of EN only queries on Gemma 2. • 3 items • Updated Oct 3 • 25

upvoted a collection 2 months ago

Borea

3 items • Updated Aug 21 • 2

upvoted a paper 2 months ago

Ruri: Japanese General Text Embeddings

Paper • 2409.07737 • Published Sep 12 • 7

upvoted 3 collections 3 months ago

Ruri: Japanese General Text Embeddings

18 items • Updated Sep 13 • 13

Japanese SimCSE

Tsukagoshi et al., Japanese SimCSE Technical Report, arXiv 2023. https://arxiv.org/abs/2310.19349 • 5 items • Updated Sep 4 • 2

Japanese Retrieval

3 items • Updated Aug 20 • 3

upvoted 4 collections 4 months ago

llama-3-youko

The youko model series are based on the llama-3 series and have been continually pre-trained on Japanese-specific corpora. • 9 items • Updated Sep 30 • 1

EZO

18 items • Updated Oct 3 • 2

Llama 3.1 GPTQ, AWQ, and BNB Quants

Optimised Quants for high-throughput deployments! Compatible with Transformers, TGI & VLLM 🤗 • 9 items • Updated Sep 26 • 55

Sarashina

Large Language Models developed by SB Intuitions • 7 items • Updated 14 days ago • 2

upvoted a paper 4 months ago

LLM-jp: A Cross-organizational Project for the Research and Development of Fully Open Japanese LLMs

Paper • 2407.03963 • Published Jul 4 • 15

upvoted 3 collections 5 months ago

DeBERTa V3

6 items • Updated Jul 5 • 1

neoAI LLM

1 item • Updated Jun 26 • 1

Llama-3-Swallow

4 items • Updated Jul 1 • 4