Multimodal Foundation Models

Multimodal Foundation Models

From Specialists to General-Purpose Assistants

Versandkostenfrei!
Versandfertig in 1-2 Wochen
84,99 €
inkl. MwSt.
PAYBACK Punkte
42 °P sammeln!
This monograph presents a comprehensive survey of the taxonomy and evolution of multimodal foundation models that demonstrate vision and vision-language capabilities, focusing on the transition from specialist models to general-purpose assistants. The focus encompasses five core topics, categorized into two classes; (i) a survey of well-established research areas: multimodal foundation models pre-trained for specific purposes, including two topics - methods of learning vision backbones for visual understanding and text-to-image generation; (ii) recent advances in exploratory, open research are...