Tolga Topal
Broschiertes Buch

What Fuels Transformers in Computer Vision? Unraveling ViT's Advantages

Versandkostenfrei!
Versandfertig in 1-2 Wochen
18,95 €
inkl. MwSt.
Weitere Ausgaben:
PAYBACK Punkte
0 °P sammeln!
Master's Thesis from the year 2022 in the subject Computer Sciences - Artificial Intelligence, grade: 7.50, Universidad de Alcalá, course: Artificial Intelligence and Deep Learning, language: English, abstract: Vision Transformers (ViT) are neural model architectures that compete and exceed classical convolutional neural networks (CNNs) in computer vision tasks. ViT's versatility and performance is best understood by proceeding with a backward analysis. In this study, we aim to identify, analyse and extract the key elements of ViT by backtracking on the origin of Transformer neural architectu...