Gutscheinbedingungen

**Gültig vom 15.06.2026 bis 17.06.2026 | Gültig für nicht preisgebundene fremdsprachige Bücher | Einzelne Artikel können ausgeschlossen sein | Maximaler rabattfähiger Warenkorbwert 500 € | Nicht kombinierbar mit weiteren Aktionen | Nur einmal pro Person einlösbar | Nur solange der Vorrat reicht

Produktbild: Large Vision-Language Models
- 10%

Large Vision-Language Models Pre-training, Prompting, and Applications

10% sparen

171,99 € UVP 192,59 €

inkl. gesetzl. MwSt., Versandkostenfrei

Lieferung nach Hause

Beschreibung

Produktdetails

Einband

Gebundene Ausgabe

Erscheinungsdatum

31.08.2025

Abbildungen

XVII, 1 illus., schwarz-weiss Illustrationen

Herausgeber

Kaiyang Zhou + weitere

Verlag

Springer

Seitenzahl

429

Maße (L/B/H)

24,1/16/3 cm

Gewicht

832 g

Sprache

Englisch

ISBN

978-3-031-94968-5

Beschreibung

Portrait

Kaiyang Zhou is an Assistant Professor at the Department of Computer Science, Hong Kong Baptist University, working on computer vision and machine learning. He has published more than 30 technical papers in top-tier journals and conferences in relevant fields, including CVPR, ICCV, ECCV, NeurlPS, ICLR, ICML, AAAI, IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), and International Journal of Computer Vision (IJCV), with over 10,000 citations received in total. He is an Associate Editor of IJCV, the flagship journal in computer vision, and regularly serves as area chair and senior program committee for top-tier computer vision and machine learning conferences, such as NeurIPS, CVPR, ECCV, and AAAI.

Ziwei Liu is an Associate Professor at Nanyang Technological University, Singapore. His research interests include computer vision, machine learning, and computer graphics. He has published extensively with top-tier conferences and journals in relevant fields, including CVPR, ICCV, ECCV, NeurlPS, ICLR, ICML, IEEE Transactions on Pattern Analysis and Machine Intelligence, ACM Transactions on Graphics and Nature - Machine Intelligence. He is the recipient of ICCV Young Researcher Award, HKSTP Best Paper Award, CVPR Best Paper Award Candidate, ICBS Frontiers of Science Award and MIT Technology Review Innovators under 35 Asia Pacific. He serves as an area chair of CVPR, ICCV, ECCV, NeurlPS and ICLR, as well as an associate editor of International Journal of Computer Vision. 

Peng Gao is a research scientist at Shanghai Artificial Intelligence Laboratory, working on large language models and vision-language models. His research interests include vision-language models, large language models and diffusion models for contents creation. He has published more than 40 papers in top-tier journals and conferences, including International Journal of Computer Vision (IJCV), ICML, ICLR, NeurIPS, CVPR, ICCV and ECCV, receiving more than 10,000 citations. He has led several influential open-source projects including LLaMa-Adapter and the Lumina series, receiving more than 7000 and 2000 stars, respectively. 

Produktdetails

Einband

Gebundene Ausgabe

Erscheinungsdatum

31.08.2025

Abbildungen

XVII, 1 illus., schwarz-weiss Illustrationen

Herausgeber

Verlag

Springer

Seitenzahl

429

Maße (L/B/H)

24,1/16/3 cm

Gewicht

832 g

Sprache

Englisch

ISBN

978-3-031-94968-5

Herstelleradresse

Springer-Verlag GmbH
Tiergartenstr. 17
69121 Heidelberg
DE

Email: ProductSafety@springernature.com

Kundinnen und Kunden meinen

0 Bewertungen

Informationen zu Bewertungen

Zur Abgabe einer Bewertung ist eine Anmeldung im Konto notwendig. Die Authentizität der Bewertungen wird von uns nicht überprüft. Wir behalten uns vor, Bewertungstexte, die unseren Richtlinien widersprechen, entsprechend zu kürzen oder zu löschen.

Die Bewertungen sind nach Format, Anzahl Sterne und Datum sortiert.

Verfassen Sie die erste Bewertung zu diesem Artikel

Helfen Sie anderen Kund*innen durch Ihre Meinung

Kundinnen und Kunden meinen

0 Bewertungen filtern

  • Produktbild: Large Vision-Language Models
  • Part 1: Pre-training and Datasets.- Chapter 1: LAION-5B: A Massive Open Image-Text Dataset.- Chapter 2: Efficient Training of Large-Scale Vision-Language Models.- Chapter 3: Scaling Laws for Contrastive Language-Image Learning.- Chapter 4: Scaling Up Vision-Language Models for Generic Tasks.- Chapter 5: Searching for Next-Gen Multimodal Datasets.- Part 2: Prompting and Generalization.- Chapter 6: Soft Prompt Learning for Vision-Language Models.- Chapter 7: Unified Prompting for Vision and Language.- Chapter 8: Zero-Shot Image Classification with Custom Prompts.- Chapter 9: Enhancing Vision-Language Models with Feature Adapters.- Chapter 10: Automatic Optimization of Prompting Architectures.- Chapter 11: Open-Vocabulary Calibration for VL Models.- Part 3: Applications.- Chapter 12: Open-Vocabulary DETR with Conditional Matching.- Chapter 13: Extracting Dense Labels from CLIP.- Chapter 14: PointCLIP: Understanding Point Clouds with VL.- Chapter 15: Diffusion-Based Relation Inversion from Images.- Chapter 16: Text-to-Video Generation.- Chapter 17: Text-Driven Human Motion Generation.- Chapter 18: Zero-Shot Text-Driven 3D Avatar Generation.- Chapter 19: Zero-Shot Text-Driven HDR Panorama Generation.