New ArrivalsHealth & WellnessValentine’s DayClothing, Shoes & AccessoriesHomeKitchen & DiningGroceryHousehold EssentialsFurnitureOutdoor Living & GardenBabyToysVideo GamesElectronicsMovies, Music & BooksBeautyPersonal CareGift IdeasParty SuppliesCharacter ShopSports & OutdoorsBackpacks & LuggageSchool & Office SuppliesPetsUlta Beauty at TargetTarget OpticalGift CardsBullseye’s PlaygroundDealsClearanceTarget New Arrivals Target Finds #TargetStyleHanukkahStore EventsAsian-Owned Brands at TargetBlack-Owned or Founded Brands at TargetLatino-Owned Brands at TargetWomen-Owned Brands at TargetLGBTQIA+ ShopTop DealsTarget Circle DealsWeekly AdShop Order PickupShop Same Day DeliveryRegistryRedCardTarget CircleFind Stores
Build a Text-To-Image Generator (from Scratch) - by  Mark Liu (Paperback) - 1 of 1

Build a Text-To-Image Generator (from Scratch) - by Mark Liu (Paperback)

$59.99

Out of Stock

Eligible for registries and wish lists

Sponsored

About this item

Highlights

  • Get a free eBook (PDF or ePub) from Manning as well as access to the online liveBook format (and its AI assistant that will answer your questions in any language) when you purchase the print book.
  • About the Author: Dr. Mark Liu is a tenured finance professor and the founding director of the Master of Science in Finance program at the University of Kentucky.
  • 360 Pages
  • Computers + Internet, Computer Vision & Pattern Recognition

Description



Book Synopsis



Get a free eBook (PDF or ePub) from Manning as well as access to the online liveBook format (and its AI assistant that will answer your questions in any language) when you purchase the print book.

This book takes you step-by-step through creating your own AI models that can generate images from text. You'll explore two methods of image generation--vision transformers and diffusion models--and learn vital AI development techniques as you go.

Dive into the powerful models behind AI image generators. The best way to learn is to build something from scratch, and in this book you'll build your very own diffusion model and vision transformer. As you work through each stage of development, you'll develop an understanding of how these models can be customized, applied, and integrated for impressive multimodal AI.

Build a Text-to-Image Generator (from Scratch) teaches you how to:

- Build and train models to generate high resolution images based on text descriptions
- Edit an existing image based on text prompts
- Build and train a model to add captions to images
- Build and train a vision transformer to classify images
- Fine-tune LLMs for downstream tasks such as classification, text or image generation
- Better differentiate real images from deepfakes

About the technology

AI-generated images appear everywhere from high-end advertising to casual social media feeds. Text-to-image tools like Dall-e, Midjourney, and Flux make it easy to create AI art, but how do they work? In this book, you'll find out by building your own text-to-image generator!

About the book

Build a Text-to-Image Generator (from Scratch) explores both transformer-based image generation and diffusion models. You'll work hands-on to build a pair of simple generation models that can classify images, automatically add captions, reconstruct images, and enhance existing graphics. Author Mark Liu guides you every step of the way with clear explanations, informative diagrams, and eye-opening examples you can build on your own laptop.

What's inside

- Build a vision transformer to classify images
- Edit images using text prompts
- Fine-tune image models

About the reader

Requires basic knowledge of generative AI models and intermediate Python skills.

About the author

Mark Liu is the founding director of the Master of Science in Finance program at the University of Kentucky. He is also the author of Learn Generative AI with PyTorch.

Table of Contents

Part 1
1 A tale of two models: Transformers and diffusions
2 Build a transformer
3 Classify images with a vision transformer
4 Add captions to images
Part 2
5 Generate images with diffusion models
6 Control what images to generate in diffusion models
7 Generate high-resolution images with diffusion models
Part 3
8 CLIP: A model to measure the similarity between image and text
9 Text-to-image generation with latent diffusion
10 A deep dive into Stable Diffusion
Part 4
11 VQGAN: Convert images into sequences of integers
12 A minimal implementation of DALL-E
Part 5
13 New developments and challenges in text-to-image generation
A Installing PyTorch and enabling GPU training locally and in Colab



About the Author



Dr. Mark Liu is a tenured finance professor and the founding director of the Master of Science in Finance program at the University of Kentucky. He has more than 20 years of coding experience, a Ph.D. in finance from Boston College.
Dimensions (Overall): 9.25 Inches (H) x 7.38 Inches (W)
Weight: .92 Pounds
Suggested Age: 22 Years and Up
Number of Pages: 360
Genre: Computers + Internet
Sub-Genre: Computer Vision & Pattern Recognition
Publisher: Manning Publications
Format: Paperback
Author: Mark Liu
Language: English
Street Date: December 30, 2025
TCIN: 1006354382
UPC: 9781633435421
Item Number (DPCI): 247-03-0351
Origin: Made in the USA or Imported
If the item details aren’t accurate or complete, we want to know about it.

Shipping details

Estimated ship dimensions: 1 inches length x 7.38 inches width x 9.25 inches height
Estimated ship weight: 0.92 pounds
We regret that this item cannot be shipped to PO Boxes.
This item cannot be shipped to the following locations: American Samoa (see also separate entry under AS), Guam (see also separate entry under GU), Northern Mariana Islands, Puerto Rico (see also separate entry under PR), United States Minor Outlying Islands, Virgin Islands, U.S., APO/FPO

Return details

This item can be returned to any Target store or Target.com.
This item must be returned within 90 days of the date it was purchased in store, shipped, delivered by a Shipt shopper, or made ready for pickup.
See the return policy for complete information.

Related Categories

Get top deals, latest trends, and more.

Privacy policy

Footer

About Us

About TargetCareersNews & BlogTarget BrandsBullseye ShopSustainability & GovernancePress CenterAdvertise with UsInvestorsAffiliates & PartnersSuppliersTargetPlus

Help

Target HelpReturnsTrack OrdersRecallsContact UsFeedbackAccessibilitySecurity & FraudTeam Member ServicesLegal & Privacy

Stores

Find a StoreClinicPharmacyTarget OpticalMore In-Store Services

Services

Target Circle™Target Circle™ CardTarget Circle 360™Target AppRegistrySame Day DeliveryOrder PickupDrive UpFree 2-Day ShippingShipping & DeliveryMore Services
PinterestFacebookInstagramXYoutubeTiktokTermsCA Supply ChainPrivacy PolicyCA Privacy RightsYour Privacy ChoicesInterest Based AdsHealth Privacy Policy