about tokenizer padding

by mmjwxbc - opened 24 days ago

24 days ago

rna_tokens = rna_tokenizer(train_rnas, padding=True, truncation=True, return_tensors='pt', max_length=50)['input_ids']

why this instruction can not pad my token

MultiMolecule org 23 days ago

The tokenizer auto pads all sequences to the longest sequence in a batch.
max_length applies to truncation only.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment