Part VII: AI Applications

Multimodal models and real-world LLM applications across industries, from code generation to healthcare to scientific discovery.

"The eye sees only what the mind is prepared to comprehend."

Robertson Davies

Part Overview

Part VII covers the practical application of LLMs across modalities and industries. It opens with multimodal AI (vision, audio, video, documents) and then provides in-depth treatment of LLM applications in software engineering, finance, healthcare, cybersecurity, education, and scientific discovery. Each application domain receives substantial coverage of key techniques, code patterns, case studies, and domain-specific tools.

Chapters: 2 (Chapters 27 and 28). These chapters connect the technical foundations from Parts I through VI to real-world deployment scenarios, domain-specific challenges, and industry best practices.

Big Picture

Language is just one modality. Part VII extends your LLM knowledge to vision, audio, and cross-modal systems, then surveys the rich landscape of LLM-powered applications from code generation to healthcare. Understanding multimodal capabilities prepares you for the convergence of AI modalities that is shaping the next generation of products.

Beyond text: vision-language models, image generation (diffusion, DALL-E, Stable Diffusion), audio and speech models, video generation, and building multimodal applications.

What Comes Next

Continue to Part VIII: Evaluation and Production, where we ensure your LLM systems work reliably with evaluation, observability, and production engineering.