Center for Language & Speech Processing(CLSP), JHU
Takeaways from the SCALE 2024 Workshop on Video-based Event Retrieval
Explore multilingual event-centric video retrieval techniques, focusing on non-professional content. Learn about dataset creation, modality-specific improvements, and key findings in text extraction and fusion.