Katharine Jarmul on using Python for data analysis

O'Reilly Programming Podcast - O'Reilly Media Podcast

O'Reilly Media에서 제공하는 콘텐츠입니다. 에피소드, 그래픽, 팟캐스트 설명을 포함한 모든 팟캐스트 콘텐츠는 O'Reilly Media 또는 해당 팟캐스트 플랫폼 파트너가 직접 업로드하고 제공합니다. 누군가가 귀하의 허락 없이 귀하의 저작물을 사용하고 있다고 생각되는 경우 여기에 설명된 절차를 따르실 수 있습니다 https://ko.player.fm/legal.

8y ago 26:17

MP3•에피소드 홈

The O’Reilly Programming Podcast: Wrangling data with Python’s libraries and packages.

In this episode of the O’Reilly Programming Podcast, I talk with Katharine Jarmul, a Python developer and data analyst whose company, Kjamistan, provides consulting and training on topics surrounding machine learning, natural language processing, and data testing. Jarmul is the co-author (along with Jacqueline Kazil) of the O’Reilly book Data Wrangling with Python, and she has presented the live online training course Practical Data Cleaning with Python.

Discussion points:

How data wrangling enables you to take real-world data and “clean it, organize it, validate it, and put it in some format you can actually work with,” says Jarmul.
Why Python has become a preferred language for use in data science: Jarmul cites the accessibility of the language and the emergence of packages such as NumPy, pandas, SciPy, and scikit-learn.
Jarmul calls pandas “Excel on steroids” and says, “it allows you to manipulate tabular data, and transform it quite easily. For anyone using structured, tabular data, you can’t go wrong with doing some part of your analysis in pandas.”
She cites gensim and spaCy as her favorite NLP Python libraries, praising them for “the ability to just install a library and have it do quite a lot of deep learning or machine learning tasks for you.”

Other links:

Check out the video Building Data Pipelines with Python, presented by Jarmul.
Check out the video Data Wrangling and Analysis with Python, presented by Jarmul.
Jarmul is one of the founders of the group PyLadies, which focuses on helping more women become active participants and leaders in the Python open source community.

25 에피소드

#Tech #Society #Business #Tech News #At OReilly #OReilly Media