Large Vision-Language Models (eBook, PDF)
eBook, PDF

Large Vision-Language Models (eBook, PDF)

Pre-training, Prompting, and Applications

Redaktion: Zhou, Kaiyang; Gao, Peng; Liu, Ziwei
Versandkostenfrei!
Sofort per Download lieferbar
136,95 €
inkl. MwSt.
Weitere Ausgaben:
PAYBACK Punkte
68 °P sammeln!
The rapid progress in the field of large multimodal foundation models, especially vision-language models, has dramatically transformed the landscape of machine learning, computer vision, and natural language processing. These powerful models, trained on vast amounts of multimodal data mixed with images and text, have demonstrated remarkable capabilities in tasks ranging from image classification and object detection to visual content generation and question answering. This book provides a comprehensive and up-to-date exploration of large vision-language models, covering the key aspects of thei...