shijie xia
seven-cat
AI & ML interests
LLMs
Recent Activity
upvoted
a
paper
36 minutes ago
LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation
upvoted
a
paper
about 1 month ago
GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization
upvoted
a
paper
2 months ago
RoboOmni: Proactive Robot Manipulation in Omni-modal Context