Demystifying Large Language Models

James Chen

ISBN: 9781738908479

Hardcover | 346 pagina's | 26 april 2024

€ 33.99

This book is a comprehensive guide aiming to demystify the world of transformers -- the architecture that powers Large Language Models (LLMs) like GPT and BERT. From PyTorch basics and mathematical foundations to implementing a Transformer from scratch, you'll gain a deep understanding of the inner workings of these models.That's just the beginning. Get ready to dive into the realm of pre-training your own Transformer from scratch, unlocking the power of transfer learning to fine-tune LLMs for your specific use cases, exploring advanced techniques like PEFT (Prompting for Efficient Fine-Tuning) and LoRA (Low-Rank Adaptation) for fine-tuning, as well as RLHF (Reinforcement Learning with Human Feedback) for detoxifying LLMs to make them aligned with human values and ethical norms.Step into the deployment of LLMs, delivering these state-of-the-art language models into the real-world, whether integrating them into cloud platforms or optimizing them for edge devices, this section ensures you're equipped with the know-how to bring your AI solutions to life.Whether you're a seasoned AI practitioner, a data scientist, or a curious developer eager to advance your knowledge on the powerful LLMs, this book is your ultimate guide to mastering these cutting-edge models. By translating convoluted concepts into understandable explanations and offering a practical hands-on approach, this treasure trove of knowledge is invaluable to both aspiring beginners and seasoned professionals.

Lees meer

Lees minder

Details

ISBN: 9781738908479
Auteur(s): James Chen
Prijs: € 33.99
Verschenen: 26 april 2024
Taal: Engels
Aantal pagina's: 346
Bindwijze: Hardcover
Uitgever: James Chen
Afmetingen: 229 x 152 x 27 mm
Gewicht: 635 g

Thema

Computers & Informatica

James Chen

Details

Thema

Beschikbaar als