robotics-diffusion-transformer
/

rdt-1b

Inference Endpoints

Model card Files Files and versions Community

robotics-diffusion-transformer commited on Aug 28

Commit

f03ec58

•

1 Parent(s): ba0ed2f

Update README.md

Files changed (1) hide show

README.md +16 -9

README.md CHANGED Viewed

@@ -1,29 +1,36 @@
 ---
 license: mit
 ---
 # RDT-1B
 RDT-1B is a 1B-parameter imitation learning Diffusion Transformer pre-trained on 1M+ multi-robot episodes. Given a language instruction and 3-view RGB image observations, RDT can predict the next
 64 robot actions. RDT is inherently compatible with almost all kinds of modern mobile manipulators, from single-arm to dual-arm, joint to EEF, pos. to vel., and even with a mobile chassis.
-All the code and model weights are licensed under MIT license.
-Please refer to our [project page](), [github repository]() and [paper]() for more information.
 ## Model Details
-- **Developed by** Thu-ml team
 - **License:** MIT
-- **Pretrain dataset:** [More Information Needed]
-- **Finetune dataset:** [More Information Needed]
-- **Repository:** [More Information Needed]
-- **Paper :** [More Information Needed]
 - **Project Page:** https://rdt-robotics.github.io/rdt-robotics/
 ## Uses
-RDT-1B supports finetuning and pre-training on custom dataset, as well as deploying and inferencing on real-robots.
 Please refer to [our repository](https://github.com/GeneralEmbodiedSystem/RoboticsDiffusionTransformer/blob/main/docs/pretrain.md) for all the above guides.

 ---
 license: mit
+language:
+- en
 ---
 # RDT-1B
 RDT-1B is a 1B-parameter imitation learning Diffusion Transformer pre-trained on 1M+ multi-robot episodes. Given a language instruction and 3-view RGB image observations, RDT can predict the next
 64 robot actions. RDT is inherently compatible with almost all kinds of modern mobile manipulators, from single-arm to dual-arm, joint to EEF, pos. to vel., and even with a mobile chassis.
+All the [code]() and pretrained model weights are licensed under MIT license.
+Please refer to our [project page](https://rdt-robotics.github.io/rdt-robotics/) and [paper]() for more information.
 ## Model Details
+- **Developed by** RDT Team from Tsinghua University.
 - **License:** MIT
+- **Language(s) (NLP):** en
+- **Model Architecture:** Diffusion Transformer.
+- **Pretrain dataset:** Curated pretrain dataset collected from 46 datasets. Please see [here]() for detail.
+- **Repository:** [repo_url]
+- **Paper :** [paper_url]
 - **Project Page:** https://rdt-robotics.github.io/rdt-robotics/
 ## Uses
+RDT takes language instruction, image observations and proprioception as input, and predicts the next 64 robot actions in the form of unified action space vector,
+including all the main physical quantities of robots, including the end-effector and joint, position and velocity, base movement, etc.
+### Getting Started
+RDT-1B supports finetuning on custom dataset, deploying and inferencing on real-robots, as well as pretraining the model.
 Please refer to [our repository](https://github.com/GeneralEmbodiedSystem/RoboticsDiffusionTransformer/blob/main/docs/pretrain.md) for all the above guides.