Hayato TSUKAGOSHI's picture

Hayato TSUKAGOSHI PRO

hpprc

·

https://hpprc.dev

AI & ML interests

Graduate School of Informatics, Nagoya University, Japan. Favorites: Sentence embedding, lexical semantics.

Recent Activity

New activity about 15 hours ago

hpprc/tanaka-corpus:[bot] Conversion to Parquet

liked a model about 19 hours ago

cross-encoder/mmarco-mMiniLMv2-L12-H384-v1

liked a model about 20 hours ago

Unbabel/TowerInstruct-7B-v0.2

View all activity

Organizations

hpprc's activity

upvoted a collection about 1 month ago

japanese-splade

日本語SPLADEモデル • 1 item • Updated Oct 23 • 1

upvoted a paper about 1 month ago

PLaMo-100B: A Ground-Up Language Model Designed for Japanese Proficiency

Paper • 2410.07563 • Published Oct 10 • 2

upvoted 3 collections about 2 months ago

japanese-embedding-models

3 items • Updated Feb 3 • 5

Gemma 2 JPN Release

A Gemma 2 2B model fine-tuned on Japanese text. It supports the Japanese language the same level of performance of EN only queries on Gemma 2. • 3 items • Updated Oct 3 • 25

MS MARCO Mined Triplets

These datasets contain MS MARCO Triplets gathered by mining hard negatives using various models. Each dataset has various subsets. • 14 items • Updated May 21 • 10

upvoted a collection 2 months ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Sep 18 • 382

upvoted a collection 3 months ago

Japanese Retrieval

3 items • Updated Aug 20 • 3

upvoted 3 papers 4 months ago

LLM-jp: A Cross-organizational Project for the Research and Development of Fully Open Japanese LLMs

Paper • 2407.03963 • Published Jul 4 • 15

Continual Pre-Training for Cross-Lingual LLM Adaptation: Enhancing Japanese Language Capabilities

Paper • 2404.17790 • Published Apr 27 • 5

Building a Large Japanese Web Corpus for Large Language Models

Paper • 2404.17733 • Published Apr 27 • 4

upvoted a collection 5 months ago

japanese-embedding-datasets

4 items • Updated Jun 23 • 1

upvoted a paper 6 months ago

JaColBERT and Hard Negatives, Towards Better Japanese-First Embeddings for Retrieval: Early Technical Report

Paper • 2312.16144 • Published Dec 26, 2023 • 3