rvachev.orgEN / RU / 🤖
← Back to essays
· Essay · 1 min

Multimodal AI Meta: The Future of Search and Interaction

Meta AI has released an open-source project working with 6 modalities, opening new opportunities for AR/VR.

Multimodal AI Meta: The Future of Search, Generation, and Interaction with the Virtual World in 6 Data Types

Multimodality is gradually permeating our lives.

Meta AI has released an open-source project that works with 6 modalities, allowing not only text, images, and video, but also infrared images and other data, opening up possibilities for working with AR/VR information.

Here are the possibilities this opens up:

- Multimodal search (like Google, but simultaneously across 6 modalities). Example: find a virtual world that has a space the size of a football field, and where there were dancing cats.

- Arithmetic computations with vectors. Previously,