Join our new short course, Efficiently Serving Large Language Models, to build a ground-up understanding of how to serve LLM applications from Travis Addair, CTO at Predibase. Whether you’re ready to launch your own application or just getting started building it, the topics you’ll explore in this course will deepen your foundational knowledge of how LLMs work, and help you better understand the performance trade-offs you must consider when building LLM applications that will serve large numbers of users.



Recommended experience
What you'll learn
- Learn how Large Language Models (LLMs) repeatedly predict the next token, and how techniques like KV caching can greatly speed up text generation. 
- Code for efficient LLM app serving, balancing model output speed and serving many users at once. 
- Explore the fundamentals of Low Rank Adapters and see how Predibase builds their framework inference server to serve fine-tuned models at once. 
Skills you'll practice
Details to know
Only available on desktop
See how employees at top companies are mastering in-demand skills

Learn, practice, and apply job-ready skills in less than 2 hours
- Receive training from industry experts
- Gain hands-on experience solving real-world job tasks

About this project
Instructor

Offered by
How you'll learn
- Hands-on, project-based learning - Practice new skills by completing job-related tasks with step-by-step instructions. 
- No downloads or installation required - Access the tools and resources you need in a cloud environment. 
- Available only on desktop - This project is designed for laptops or desktop computers with a reliable Internet connection, not mobile devices. 
Why people choose Coursera for their career




You might also like
 Status: Free Status: Free- DeepLearning.AI 
 Status: Free Status: Free- DeepLearning.AI 
 Status: Free Trial Status: Free Trial- Duke University 
 Status: Free Trial Status: Free Trial- Duke University 

Open new doors with Coursera Plus
Unlimited access to 10,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription
Advance your career with an online degree
Earn a degree from world-class universities - 100% online
Join over 3,400 global companies that choose Coursera for Business
Upskill your employees to excel in the digital economy
Frequently asked questions
In Projects, you'll complete an activity or scenario by following a set of instructions in an interactive hands-on environment. Projects are completed in a real cloud environment and within real instances of various products as opposed to a simulation or demo environment.
By purchasing a Project, you'll get everything you need to complete the Project including temporary access to any product required to complete the Project.
Even though Projects are technically available on mobile devices, we highly recommend that you complete Projects on a laptop or desktop only.

