2026
VISTA: Open-Vocabulary, Task-Relevant Robot Exploration With Online Semantic Gaussian Splatting

VISTA: Open-Vocabulary, Task-Relevant Robot Exploration With Online Semantic Gaussian Splatting

VISTA simultaneously performs semantic and geometric information gain next-best-view gathering in order to efficiently create 3D reconstructions that are salient to a language-based instruction.

#mapping #semantic search #next best view
SketchPlan: Diffusion Based Drone Planning From Human Sketches

SketchPlan: Diffusion Based Drone Planning From Human Sketches

We propose SketchPlan, a diffusion-based planner that interprets 2D hand-drawn sketches over depth images to generate 3D flight paths for drone navigation.

#planning #machine learning #Gaussian Splatting
SINGER: An Onboard Generalist Vision-Language Navigation Policy for Drones

SINGER: An Onboard Generalist Vision-Language Navigation Policy for Drones

SINGER is a recipe for generalist language guided visuomotor navigation for drones.

#imitation learning #Gaussian Splatting #VLA
SARM: Stage-Aware Reward Modeling for Long Horizon Robot Manipulation

SARM: Stage-Aware Reward Modeling for Long Horizon Robot Manipulation

SARM is a stage-aware, video-based reward modeling framework that enables scalable and robust imitation learning for long-horizon tasks by deriving progress signals from natural language annotations, dramatically improving policy performance over standard behavior cloning.

#Imitation Learning #Reward Modeling #Robotics Manipulation
Rethinking the Primitives: Next Generation LLM Architecture

Rethinking the Primitives: Next Generation LLM Architecture

A layer-by-layer redesign of the Transformer stack. Starting from the outermost layer — how position is encoded — and working inward through attention mechanisms, linear hybrid architectures, sparse expert routing, and finally normalization. Each work finds the hidden mathematical structure of one component and replaces engineering convention with a principled derivation.

#machine learning #LLM #reasoning
Phys2Real: Fusing VLM Priors with Interactive Online Adaptation for Uncertainty-Aware Sim-to-Real Manipulation

Phys2Real: Fusing VLM Priors with Interactive Online Adaptation for Uncertainty-Aware Sim-to-Real Manipulation

Fuses VLM physical priors with interactive online adaptation to enable zero-shot manipulation of objects with unknown physical properties.

#reinforcement learning #manipulation #uncertainty quantification
Understanding the Mixture-of-Experts with Nadaraya-Watson Kernel

Understanding the Mixture-of-Experts with Nadaraya-Watson Kernel

We propose KERN (Kernel Inspired Router with Normalization), a zero-additional-cost FFN-style router for MoE models. By connecting MoE to Nadaraya-Watson regression, we show that both FFN and MoE are special cases of kernel regression — motivating KERN as a principled alternative to Softmax routing. Experiments across MoE and LLM settings validate its effectiveness.

#machine learning #LLM #reasoning
Coverage Optimization for Camera View Selection

Coverage Optimization for Camera View Selection

Next best view selection can be formulated as a coverage optimization problem, leading to interpretable, reliable, and real-time performance.

#uncertainty quantification #computer vision #3D reconstruction
π, But Make It Fly: Physics-Guided Transfer of VLA Models to Aerial Manipulation

π, But Make It Fly: Physics-Guided Transfer of VLA Models to Aerial Manipulation

A study on how well manipulator pretrained VLAs transfer to aerial manipulators.

#Vision Language Action Model #Cross Embodiment Transfer #Unmanned Aerial Manipulators (UAMs)
2025
Learning Visual Drone Navigation with Gaussian Radiance Fields and Differentiable Dynamics

Learning Visual Drone Navigation with Gaussian Radiance Fields and Differentiable Dynamics

GRaD-Nav utilizes 3DGS and differentiable dynamics to train visual drone navigation policy efficiently.

#aerial #Gaussian Splatting #reinforcement learning
Safe Real-Time Robot Navigation in Gaussian Splatting Maps

Safe Real-Time Robot Navigation in Gaussian Splatting Maps

Splat-Nav perform efficient and scalable collision avoidance and pose estimation with Gaussian Splatting environments.

#Gaussian Splatting #planning #safety
A Control Barrier Function for Safe Navigation with Online Gaussian Splatting Maps

A Control Barrier Function for Safe Navigation with Online Gaussian Splatting Maps

SAFER-Splat is a fast and scalable safety filter that operates simultaneously with real-time Gaussian Splatting training, enabling safe robot motions from vision.

#Gaussian Splatting #safety #planning
Splat-Nav: Safe Real-Time Robot Navigation in Gaussian Splatting Maps

Splat-Nav: Safe Real-Time Robot Navigation in Gaussian Splatting Maps

Splat-Nav is a unified real-time safe planning and pose estimator built on a 3D Gaussian Splatting map.

#safety #state estimation #Gaussian Splatting
SAS: Simulated Attention Score

SAS: Simulated Attention Score

The attention mechanism is a core component of the Transformer architecture. Various methods have been developed to compute attention scores, including multi-head attention (MHA), multi-query attention, group-query attention and so on. We introduce Simulated Attention Score (SAS), which maintains a compact model size while simulating a larger number of attention heads and hidden feature dimension per head. To control the parameter cost, we also propose Parameter-Efficient Attention Aggregation (PEAA).

#LLM #reasoning #architecture
SAFER-Splat: Safety with Control Barrier Functions in Online Gaussian Splatting Maps

SAFER-Splat: Safety with Control Barrier Functions in Online Gaussian Splatting Maps

An algorithm for real-time, dynamic collision avoidance in your incremental 3D Gaussian Splatting mapping pipeline.

#safety #Gaussian Splating #control
HAMMER: Heterogeneous, Multi-Robot Semantic Gaussian Splatting

HAMMER: Heterogeneous, Multi-Robot Semantic Gaussian Splatting

HAMMER is a real-time multi-robot 3D Gaussian Splatting mapping pipeline for heterogeneous robot fleets.

#multi robot #mapping #Gaussian Splatting
GRaD-Nav++: Vision-Language Model Enabled Visual Drone Navigation with Gaussian Radiance Fields and Differentiable Dynamics

GRaD-Nav++: Vision-Language Model Enabled Visual Drone Navigation with Gaussian Radiance Fields and Differentiable Dynamics

A light-weighted onboard drone flight VLA framework that based on Differentiable RL and 3DGS.

#Reinforcement learning #vision based navigation #aerial systems: perception and autonomy
Conformal Safety Monitoring for Flight Testing: A Case Study in Data-Driven Safety Learning

Conformal Safety Monitoring for Flight Testing: A Case Study in Data-Driven Safety Learning

We use stochastic trajectory simulation to learn a calibrated statistical model of the short-term safety risk facing test pilots.

#statistics #uncertainty quantification #safety
Semantic-Metric Bayesian Risk Fields: Learning Robot Safety from Human Videos with a VLM Prior

Semantic-Metric Bayesian Risk Fields: Learning Robot Safety from Human Videos with a VLM Prior

We propose a human-aligned Bayesian risk model for quantifying safety in in-the-wild scenarios, beyond settings like collision avoidance.

#safety #computer vision
ARCH: Hierarchical Hybrid Learning for Long-Horizon Contact-Rich Robotic Assembly

ARCH: Hierarchical Hybrid Learning for Long-Horizon Contact-Rich Robotic Assembly

We propose ARCH (Adaptive Robotic Compositional Hierarchy) for long-horizon, contact-rich robotic assembly. ARCH combines a low-level primitive library of RL and model-based skills with a high-level imitation learning policy that selects and parameterizes primitives from a handful of demonstrations, enabling generalizable high-precision assembly.

#machine learning #planning
2020
Game Theoretic Planning for Autonomous Driving

Game Theoretic Planning for Autonomous Driving

When interaction and negotiation between vehicles is key, an autonomous driving car should reason about the influence of its decisions over the surrounding cars' trajectories. We propose a fast planner that accounts for the game-theoretic interactions between vehicles.

#game theory #autonomous vehicle #planning
Neural Network Reachability

Neural Network Reachability

We develop a novel method for computing exact forward and backward reachable sets of a neural network with ReLU activation.

#reachability #machine learning #safety
Distributed Multi-Target Tracking for Autonomous Vehicle Fleets

Distributed Multi-Target Tracking for Autonomous Vehicle Fleets

We present a distributed algorithm enabling fleets of autonomous vehicles to track multiple targets within their environments while communicating locally with their neighbors.

#multi robot #autonomous vehicle #state estimation
Wildfire Control and Estimation

Wildfire Control and Estimation

We study algorithms for modelling, estimating, and controlling wildfires using multi-robot teams.

#multi robot #reinforcement learning #planning
Game Theoretic Planning for Autonomous Drone Racing

Game Theoretic Planning for Autonomous Drone Racing

How should robots plan to compete? We propose a game theoretic planner which allows drones to reason online about their opponents' actions during race scenarios.

#aerial #game theory #planning
Shape-Changing Truss Robots

Shape-Changing Truss Robots

We introduce an truss robot that is composed primarily of inflated fabric tubes, and is capable of locomotion and object manipulation.

#locomotion #planning #control
Planning for Human-Robot Interaction

Planning for Human-Robot Interaction

We combine risk-sensitive optimal control tools with deep generative modeling to enable robotic crowd navigation, and observe diverse interaction behavior by varying the robot's risk sensitivity.

#planning #safety #autonomous vehicle
Adaptive Control for Collaborative Manipulation

Adaptive Control for Collaborative Manipulation

How can robots work together to move unknown payloads? Using tools from adaptive control, we develop a distributed algorithm for collaborative manipulation without prior payload knowledge.

#manipulation #multi robot #control