2026-04-05 / 2604.02327
Steerable Visual Representations
2026-04-05 / 2604.02215
Universal Hypernetworks for Arbitrary Models
2026-04-05 / 2604.01860
POCO: Posterior Optimization with Clipped Objective for Bridging Efficiency and Stability in Generative Policy Learning
2026-04-05 / 2604.01681
Bridging Large-Model Reasoning and Real-Time Control via Agentic Fast-Slow Planning
2026-04-05 / 2604.01605
F3DGS: Federated 3D Gaussian Splatting for Decentralized Multi-Agent World Modeling
2026-04-05 / 2604.01577
Thinking While Listening: Fast-Slow Recurrence for Long-Horizon Sequential Modeling
2026-04-05 / 2604.01570
Boosting Vision-Language-Action Finetuning with Feasible Action Neighborhood Prior
2026-04-05 / 2604.01567
AnchorVLA: Anchored Diffusion for Efficient End-to-End Mobile Manipulation
2026-04-04 / 2604.02292
Taming the Exponential: A Fast Softmax Surrogate for Integer-Native Edge Inference
2026-04-04 / 2604.02051
Ouroboros: Dynamic Weight Generation for Recursive Transformers via Input-Conditioned LoRA Modulation
2026-04-04 / 2604.01985
World Action Verifier: Self-Improving World Models via Forward-Inverse Asymmetry
2026-04-04 / 2604.01765
DriveDreamer-Policy: A Geometry-Grounded World-Action Model for Unified Generation and Planning
2026-04-02 / 2603.29844
DIAL: Decoupling Intent and Action via Latent World Modeling for End-to-End VLA
2026-04-02 / 2603.29535
Quantization with Unified Adaptive Distillation to enable multi-LoRA based one-for-all Generative Vision Models on edge
2026-04-02 / 2603.29409
CLaD: Planning with Grounded Foresight via Cross-Modal Latent Dynamics
2026-04-02 / 2603.29090
HCLSM: Hierarchical Causal Latent State Machines for Object-Centric World Modeling
2026-04-02 / 2603.29078
PolarQuant: Optimal Gaussian Weight Quantization via Hadamard Rotation for LLM Compression
2026-04-02 / 2603.28955
Enhancing Policy Learning with World-Action Model
2026-04-02 / 2603.27914
ITQ3_S: Interleaved Ternary Quantization with TurboQuant — High-Fidelity 3-bit LLM Inference via Rotation-Domain Adaptive Quantization
2026-04-01 / 2604.01001
EgoSim: Egocentric World Simulator for Embodied Interaction Generation
2026-04-01 / 2603.27402
A 64-Spin All-to-All CMOS Ising Machine with Landscape Perturbation Achieving 2.28 nJ/Edge-Bit Energy-to-Solution
2026-04-01 / 2603.22078
Do World Action Models Generalize Better than VLAs? A Robustness Study
2026-04-01 / 2603.18532
Scaling Sim-to-Real Reinforcement Learning for Robot VLAs with Generative 3D Worlds
2026-04-01 / 2603.14498
R3DP: Real-Time 3D-Aware Policy for Embodied Manipulation
2026-04-01 / 2602.04037
DADP: Domain Adaptive Diffusion Policy
2026-03-31 / skywork-matrix-game-3
Matrix-Game 3.0
2026-03-31 / prismml-bonsai-1bit-8b
1-bit Bonsai 8B
2026-03-31 / anthropic-emotions-2026
Emotion Concepts and their Function in a Large Language Model
2026-03-31 / 2603.26599
VGGRPO: Towards World-Consistent Video Generation with 4D Latent Reward
2026-03-31 / 2603.26425
CPUBone: Efficient Vision Backbone Design for Devices with Low Parallelization Capabilities
2026-03-31 / 2603.26360
Realtime-VLA V2: Learning to Run VLAs Fast, Smooth, and Accurate
2026-03-31 / 2603.26320
DFM-VLA: Iterative Action Refinement for Robot Manipulation via Discrete Flow Matching
2026-03-30 / 2603.25725
SoftMimicGen: Data Generation System for Scalable Robot Learning in Deformable Object Manipulation
2026-03-30 / 2603.25661
Fast-dVLA: Accelerating Discrete Diffusion VLA to Real-Time Performance
2026-03-30 / 2603.25385
GlowQ: Group-Shared Low-Rank Approximation for Quantized LLMs
2026-03-30 / 2603.25284
SliderQuant: Accurate Post-Training Quantization for LLMs via Sliding-Layer Window Design
2026-03-30 / 2603.25038
pi, But Make It Fly: Physics-Guided Transfer of VLA Models to Aerial Manipulation
2026-03-30 / 2603.24806
FODMP: Fast One-Step Diffusion of Movement Primitives Generation for Time-Dependent Robot Actions
2026-03-29 / 2603.25716
Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models
2026-03-29 / 2603.25685
Persistent Robot World Models: Stabilizing Multi-Step Rollouts via Reinforcement Learning
2026-03-29 / 2603.25544
Towards Embodied AI with MuscleMimic: Unlocking full-body musculoskeletal motor learning at scale
2026-03-29 / 2603.25406
MMaDA-VLA: Large Diffusion Vision-Language-Action Model with Unified Multi-Modal Instruction and Generation
2026-03-29 / 2603.25399
LaMP: Learning Vision-Language-Action Policies with 3D Scene Flow as Latent Motion Prior
2026-03-29 / 2603.24916
Once-for-All Channel Mixers (HYPERTINYPW): Generative Compression for TinyML
2026-03-29 / 2603.19312
LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architecture from Pixels
2026-03-29 / 2603.16673
When Should a Robot Think? Resource-Aware Reasoning via Reinforcement Learning for Embodied Robotic Decision-Making