๐Ÿค” How to make AI models smarter for you without making them bigger?

Iโ€™m taking Berkeleyโ€™s CS294 ๐™‡๐™‡๐™ˆ ๐˜ผ๐™œ๐™š๐™ฃ๐™ฉ๐™จ course, and I particularly enjoyed a recent lecture from Burak Gokturk where he explained several approaches that will help enterprises get significantly better results from foundation models.


๐Ÿ”ง ๐—ง๐˜‚๐—ป๐—ถ๐—ป๐—ด/๐——๐—ถ๐˜€๐˜๐—ถ๐—น๐—น๐—ฎ๐˜๐—ถ๐—ผ๐—ป
โ€ข Tuning is customizing a model based on specific data and/or use case
This ranges from techniques as basic as prompt design (providing few shot examples as part of your prompt), to actual retraining of the model (either full [very costly] retraining, or more efficiently with a concept like โ€œLoRAโ€).
โ€ข Distillation is creating a smaller model for improved cost/latency, using a teacher/student pattern, where your foundation model will โ€œteachโ€ (generate training data and reward/penalise) a smaller (student) model.

Not every use case needs a model that can โ€œanswer all the questions in the worldโ€. Often, a smaller, specialized model is more effective.


โš“ ๐—š๐—ฟ๐—ผ๐˜‚๐—ป๐—ฑ๐—ถ๐—ป๐—ด
โ€ข Combine with search to make it factual (also known as โ€œRAGโ€, Retrieval Augmented Generation)
โ€ข Can be applied to web data or enterprise data

Think of this as making LLMs more reliable by connecting them to verified information sources (for example post training cutoff date), or sources that matter to your context (your enterprise data).


๐Ÿ› ๏ธ ๐—˜๐˜…๐˜๐—ฒ๐—ป๐˜€๐—ถ๐—ผ๐—ป๐˜€/๐—™๐˜‚๐—ป๐—ฐ๐˜๐—ถ๐—ผ๐—ป ๐—–๐—ฎ๐—น๐—น๐—ถ๐—ป๐—ด
Like the โ€œPhone a Friendโ€ option in โ€œWho Wants to Be a Millionaire,โ€ this involves 2 main complexities:
โ€ข Recognizing when the model needs external help
โ€ข Selecting the right tool for the task

For example, a LLM canโ€™t book flights directly, but through function calling, it could use an Expedia extension to complete this task.


As a fractional CDO, I love exploring how these concepts can address real business challenges in real world AI implementations.

If youโ€™re interested in the course, you can check it out here.

Leave a Comment

Your email address will not be published. Required fields are marked *