arxiv:2512.19535
Amelie Royer
ameroyer
AI & ML interests
Computer Vision, Domain Adaptation, Conditional Architectures
Recent Activity
authored
a paper
about 19 hours ago
Moshi: a speech-text foundation model for real-time dialogue
authored
a paper
about 19 hours ago
CASA: Cross-Attention via Self-Attention for Efficient Vision-Language Fusion
upvoted
a
paper
2 days ago
Vision-Speech Models: Teaching Speech Models to Converse about Images