Player FM 앱으로 오프라인으로 전환하세요!
The database for all your AI needs
Manage episode 506779303 series 3579868
Marcel Kornacker, the creator of Apache Impala and co-creator of Apache Parquet, joins me to talk about his latest project: Pixeltable, a multimodal AI database that combines structured and unstructured data with rich, Python-native workflows.
From ingestion to vector search, transcription to snapshots, Pixeltable eliminates painful data plumbing for modern AI teams.
Follow Marcel
- Pixeltable: https://pixeltable.com
- Pixeltable GitHub: https://github.com/pixeltable/pixeltable
- LinkedIn: https://www.linkedin.com/in/marcelkornacker
Follow Aaron
- Twitter: https://twitter.com/aarondfrancis
- LinkedIn: https://www.linkedin.com/in/aarondfrancis
- Website: https://aaronfrancis.com – find articles, podcasts, courses, and more
- Database School: https://databaseschool.com
Chapters
- 0:00 – Introduction
- 0:20 – Meet Marcel Kornacker
- 1:19 – Early career and grad school in databases
- 2:12 – Joining Google and building F1
- 3:42 – How F1 used Spanner at Google
- 4:01 – Starting Apache Impala at Cloudera
- 6:02 – Why SQL still matters
- 7:29 – What keeps Marcel fascinated with databases
- 9:37 – The “SQL is dead” waves and shift to AI
- 10:21 – Observing pain points in computer vision pipelines
- 13:02 – Multimodal data challenges and the idea for Pixeltable
- 16:10 – How Pixeltable handles transformations with computed columns
- 26:29 – Example: processing video, audio, and transcripts in Pixeltable
- 33:12 – DAG execution and parallelism explained
- 37:00 – Transactional guarantees in Pixeltable
- 39:00 – Iterators and chunking data for search
- 42:26 – Using embeddings and semantic search
- 47:05 – Updating data and incremental recomputation
- 50:06 – Thoughts on RAG and hybrid search
- 53:14 – Real-world use cases and dataset curation
- 57:00 – Example: labeling food waste on cruise ships
- 1:02:00 – Labeling workflows and syncing annotations
- 1:02:41 – Pixeltable’s roadmap and cloud vision
- 1:07:10 – How to get involved with Pixeltable
- 1:09:03 – Closing and where to find Marcel
26 에피소드
Manage episode 506779303 series 3579868
Marcel Kornacker, the creator of Apache Impala and co-creator of Apache Parquet, joins me to talk about his latest project: Pixeltable, a multimodal AI database that combines structured and unstructured data with rich, Python-native workflows.
From ingestion to vector search, transcription to snapshots, Pixeltable eliminates painful data plumbing for modern AI teams.
Follow Marcel
- Pixeltable: https://pixeltable.com
- Pixeltable GitHub: https://github.com/pixeltable/pixeltable
- LinkedIn: https://www.linkedin.com/in/marcelkornacker
Follow Aaron
- Twitter: https://twitter.com/aarondfrancis
- LinkedIn: https://www.linkedin.com/in/aarondfrancis
- Website: https://aaronfrancis.com – find articles, podcasts, courses, and more
- Database School: https://databaseschool.com
Chapters
- 0:00 – Introduction
- 0:20 – Meet Marcel Kornacker
- 1:19 – Early career and grad school in databases
- 2:12 – Joining Google and building F1
- 3:42 – How F1 used Spanner at Google
- 4:01 – Starting Apache Impala at Cloudera
- 6:02 – Why SQL still matters
- 7:29 – What keeps Marcel fascinated with databases
- 9:37 – The “SQL is dead” waves and shift to AI
- 10:21 – Observing pain points in computer vision pipelines
- 13:02 – Multimodal data challenges and the idea for Pixeltable
- 16:10 – How Pixeltable handles transformations with computed columns
- 26:29 – Example: processing video, audio, and transcripts in Pixeltable
- 33:12 – DAG execution and parallelism explained
- 37:00 – Transactional guarantees in Pixeltable
- 39:00 – Iterators and chunking data for search
- 42:26 – Using embeddings and semantic search
- 47:05 – Updating data and incremental recomputation
- 50:06 – Thoughts on RAG and hybrid search
- 53:14 – Real-world use cases and dataset curation
- 57:00 – Example: labeling food waste on cruise ships
- 1:02:00 – Labeling workflows and syncing annotations
- 1:02:41 – Pixeltable’s roadmap and cloud vision
- 1:07:10 – How to get involved with Pixeltable
- 1:09:03 – Closing and where to find Marcel
26 에피소드
모든 에피소드
×플레이어 FM에 오신것을 환영합니다!
플레이어 FM은 웹에서 고품질 팟캐스트를 검색하여 지금 바로 즐길 수 있도록 합니다. 최고의 팟캐스트 앱이며 Android, iPhone 및 웹에서도 작동합니다. 장치 간 구독 동기화를 위해 가입하세요.