
Player FM 앱으로 오프라인으로 전환하세요!
Deploy and fine-tune LLM models on Kubernetes using KAITO
Manage episode 433011321 series 3332465
In this episode of the Kubernetes Bytes podcast, Bhavin sits down with Sachi Desai, Product Manager and Paul Yu, Sr. Cloud Advocate at Microsoft to talk about the open source KAITO project. KAITO is the Kubernetes AI Toolchain Operator that enables AKS users to deploy open source LLM models on their Kubernetes clusters. They discuss how KAITO helps with running AI-enabled applications alongside the LLM models, how it helps users bring their own LLM models and run them as containers, and how KAITO helps them fine-tune open source LLMs on their Kubernetes clusters.
Check out our website at https://kubernetesbytes.com/
Cloud Native News:
- https://azure.github.io/AKS/2024/07/30/azure-container-storage-ga
- https://github.blog/news-insights/product-news/introducing-github-models/
Show links:
- Azure/kaito: Kubernetes AI Toolchain Operator - https://github.com/Azure/kaito/tree/main
- https://www.youtube.com/watch?v=3cGmHDjR_3I&list=PLc3Ep462vVYtgN4rP1ThTJd2UlsBc2sou&index=2
- https://aka.ms/cloudnative/learnlive/intelligent-apps-on-aks/episode-2
- Jumpstart AI Workflows With Kubernetes AI Toolchain Operator - The New Stack - https://thenewstack.io/jumpstart-ai-workflows-with-kubernetes-ai-toolchain-operator
- https://paulyu.dev/article/soaring-with-kaito/
- Concepts - Fine-tuning language models for AI and machine learning workflows - Azure Kubernetes Service | Microsoft Learn - https://learn.microsoft.com/en-us/azure/aks/concepts-fine-tune-language-models
- Keep up to date on the most recent announcements by following some of the KAITO engineers on LinkedIn:
- Fei Guo - https://www.linkedin.com/in/fei-guo-a48319a/
- Ishaan Sehgal - https://www.linkedin.com/in/ishaan-sehgal/
Timestamps:
- 00:02:15 Cloud Native News
- 00:05:34 Interview with Sachi and Paul
- 00:42:08 Key takeaways
88 에피소드
Manage episode 433011321 series 3332465
In this episode of the Kubernetes Bytes podcast, Bhavin sits down with Sachi Desai, Product Manager and Paul Yu, Sr. Cloud Advocate at Microsoft to talk about the open source KAITO project. KAITO is the Kubernetes AI Toolchain Operator that enables AKS users to deploy open source LLM models on their Kubernetes clusters. They discuss how KAITO helps with running AI-enabled applications alongside the LLM models, how it helps users bring their own LLM models and run them as containers, and how KAITO helps them fine-tune open source LLMs on their Kubernetes clusters.
Check out our website at https://kubernetesbytes.com/
Cloud Native News:
- https://azure.github.io/AKS/2024/07/30/azure-container-storage-ga
- https://github.blog/news-insights/product-news/introducing-github-models/
Show links:
- Azure/kaito: Kubernetes AI Toolchain Operator - https://github.com/Azure/kaito/tree/main
- https://www.youtube.com/watch?v=3cGmHDjR_3I&list=PLc3Ep462vVYtgN4rP1ThTJd2UlsBc2sou&index=2
- https://aka.ms/cloudnative/learnlive/intelligent-apps-on-aks/episode-2
- Jumpstart AI Workflows With Kubernetes AI Toolchain Operator - The New Stack - https://thenewstack.io/jumpstart-ai-workflows-with-kubernetes-ai-toolchain-operator
- https://paulyu.dev/article/soaring-with-kaito/
- Concepts - Fine-tuning language models for AI and machine learning workflows - Azure Kubernetes Service | Microsoft Learn - https://learn.microsoft.com/en-us/azure/aks/concepts-fine-tune-language-models
- Keep up to date on the most recent announcements by following some of the KAITO engineers on LinkedIn:
- Fei Guo - https://www.linkedin.com/in/fei-guo-a48319a/
- Ishaan Sehgal - https://www.linkedin.com/in/ishaan-sehgal/
Timestamps:
- 00:02:15 Cloud Native News
- 00:05:34 Interview with Sachi and Paul
- 00:42:08 Key takeaways
88 에피소드
모든 에피소드
×플레이어 FM에 오신것을 환영합니다!
플레이어 FM은 웹에서 고품질 팟캐스트를 검색하여 지금 바로 즐길 수 있도록 합니다. 최고의 팟캐스트 앱이며 Android, iPhone 및 웹에서도 작동합니다. 장치 간 구독 동기화를 위해 가입하세요.