2023 New Features of Digit Robot

About 50 results

Open links in new tab

Any time

bytedance.com
https://seed.bytedance.com › en › public_papers › ...
Understanding Chain-of-Thought in LLMs through Information ...
Jul 10, 2025 · Large Language Models (LLMs) have shown impressive performance in complex reasoning tasks through the use of Chain-of-Thought (CoT) reasoning, allowing models to break …
bytedance.com
https://seed.bytedance.com › en › public_papers › diffusion...
Diffusion Glancing Transformer for Parallel Sequence to ...
2023-11-29 Diffusion Glancing Transformer for Parallel Sequence to Sequence Learning
bytedance.com
https://seed.bytedance.com › en › public_papers › mme-cot...
MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal ...
Feb 13, 2025 · ABSTRACT Answering questions with Chain-of-Thought (CoT) has significantly enhanced the reasoning capabilities of Large Language Models (LLMs), yet its impact on Large …
bytedance.com
https://seed.bytedance.com › en › public_papers › ...
Classification Done Right for Vision-Language Pre-Training
Nov 5, 2024 · ABSTRACT We introduce SuperClass, a super simple classification method for vision-language pre-training on image-text data. Unlike its contrastive counterpart CLIP who contrast with a …
bytedance.com
https://seed.bytedance.com › en › public_papers › ...
Understanding Stragglers in Large Model Training Using What ...
May 9, 2025 · ABSTRACT Large language model (LLM) training is one of the most demanding distributed computations today, often requiring thousands of GPUs with frequent synchronization …
bytedance.com
https://seed.bytedance.com › en › public_papers › mmada...
MMaDA: Multimodal Large Diffusion Language Models
May 21, 2025 · We introduce MMaDA, a novel class of multimodal diffusion foundation models designed to achieve superior performance across diverse domains such as textual reasoning, multimodal …
bytedance.com
https://seed.bytedance.com › en › public_papers › a-predictive...
ByteDance Seed
Despite the widespread applications of machine learning force fields (MLFFs) in solids and small molecules, there is a notable gap in applying MLFFs to simulate liquid electrolytes—a critical …

Pagination
- 1
- 2
- 3
- Next

Understanding Chain-of-Thought in LLMs through Information ...

Diffusion Glancing Transformer for Parallel Sequence to ...

MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal ...

Classification Done Right for Vision-Language Pre-Training

Understanding Stragglers in Large Model Training Using What ...

MMaDA: Multimodal Large Diffusion Language Models

ByteDance Seed