www.cs.columbia.edu/~huy/scalingup/
plan
BC-Z , Jang et al, CoRL 2021.
Diffusion Policy , Chi et al, RSS 2023.
✅ reliably produces robust robot skills
✅ reliably produces robust robot skills
❌ reliance on human demonstration collection
Scalable
Skill
Learning
✅ automatic data collection and policy learning
✅ automatic data collection and policy learning
❌ exploration in sparse reward and long-horizon
Scalable
Skill
Learning
How can we get a
Unified Framework
for
Scalable Skill Learning?
✅ rich manipulation skills
✅ flexibility to novel tasks & domains
sampling-based planners generates diverse robot behavior
sampling-based planners generates diverse robot behavior
(succeeds some of the times)
def bus_is_balanced_on_the_block(state) -> bool:
return is_on_top_of("bus", "block")
❌
def bus_is_balanced_on_the_block(state) -> bool:
return is_on_top_of("bus", "block")
def bus_is_balanced_on_the_block(state) -> bool:
return is_on_top_of("bus", "block")
def bus_is_balanced_on_the_block(state) -> bool:
return is_on_top_of("bus", "block")
def bus_is_balanced_on_the_block(state) -> bool:
return is_on_top_of("bus", "block")
❌
def bus_is_balanced_on_the_block(state) -> bool:
return is_on_top_of("bus", "block")
def bus_is_balanced_on_the_block(state) -> bool:
return is_on_top_of("bus", "block")
def bus_is_balanced_on_the_block(state) -> bool:
return is_on_top_of("bus", "block")
def bus_is_balanced_on_the_block(state) -> bool:
return is_on_top_of("bus", "block")
✅
def bus_is_balanced_on_the_block(state) -> bool:
return is_on_top_of("bus", "block")
✅ Increases success rate
✅ Demonstrates retrying behavior
guide
Google Bard , Google Blog
OpenAI GPT-4 , Unite.AI
TidyBot , Wu et al, IROS 2023
Infinigen , Raistrick et al, CVPR 2023
How we can put robotics on the same scaling trends as large language models while not compromising on robust manipulation and control?
Limitations
Opportunities
www.cs.columbia.edu/~huy/scalingup/