Skip to content
Draft
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3 changes: 3 additions & 0 deletions pydata-new-york-city-2023/category.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
{
"title": "PyData New York City 2023"
}
Original file line number Diff line number Diff line change
@@ -0,0 +1,47 @@
{
"description": "www.pydata.org\n\nhttps://docs.google.com/presentation/d/1Oo_b1AoPASOLV6nyM8TWOvvBEKbxTPyG/edit?usp=drive_link\n\nExplore the intricacies of designing, implementing, and maintaining a production-ready LLM-based self-serve analytics platform. Learn about common pitfalls, essential design considerations, performance evaluation, and robust security measures like protection against prompt injection attacks.\n\nPyData is an educational program of NumFOCUS, a 501(c)3 non-profit organization in the United States. PyData provides a forum for the international community of users and developers of data analysis tools to share ideas and learn from each other. The global PyData network promotes discussion of best practices, new approaches, and emerging technologies for data management, processing, analytics, and visualization. PyData communities approach data science using many languages, including (but not limited to) Python, Julia, and R. \n\nPyData conferences aim to be accessible and community-driven, with novice to advanced level presentations. PyData tutorials and talks bring attendees the latest project features along with cutting-edge use cases.\n\n00:00 Welcome!\n00:10 Help us add time stamps or captions to this video! See the description for details.\n\nWant to help add timestamps to our YouTube videos to help with discoverability? Find out more here: https://github.com/numfocus/YouTubeVideoTimestamps",
"duration": 2338,
"language": "eng",
"recorded": "2023-11-01",
"related_urls": [
{
"label": "Conference Website",
"url": "https://pydata.org/nyc2023/"
},
{
"label": "https://github.com/numfocus/YouTubeVideoTimestamps",
"url": "https://github.com/numfocus/YouTubeVideoTimestamps"
},
{
"label": "https://docs.google.com/presentation/d/1Oo_b1AoPASOLV6nyM8TWOvvBEKbxTPyG/edit?usp=drive_link",
"url": "https://docs.google.com/presentation/d/1Oo_b1AoPASOLV6nyM8TWOvvBEKbxTPyG/edit?usp=drive_link"
}
],
"speakers": [
"TODO"
],
"tags": [
"Education",
"Julia",
"NumFOCUS",
"Opensource",
"PyData",
"Python",
"Tutorial",
"coding",
"how to program",
"learn",
"learn to code",
"python 3",
"scientific programming",
"software"
],
"thumbnail_url": "https://i.ytimg.com/vi/hiYHBjmF2Eg/maxresdefault.jpg",
"title": "Aaditya Bhat - Self-Service Analytics using LLMs | PyData NYC 2023",
"videos": [
{
"type": "youtube",
"url": "https://www.youtube.com/watch?v=hiYHBjmF2Eg"
}
]
}
Original file line number Diff line number Diff line change
@@ -0,0 +1,43 @@
{
"description": "www.pydata.org\n\nWith an average of 3.2 new papers published on Arxiv every day in 2022, causal inference has exploded in popularity, attracting large amount of talent and interest from top researchers and institutions including industry giants like Amazon or Microsoft.\n\nThere\u2019s a very good reason for this upsurge in popularity. In our contemporary data culture we got accustomed to thinking that traditional machine learning methods can provide us with answers to any interesting business or scientific questions.\n\nThis view turns out to be incorrect. Many interesting business and scientific questions are causal in their nature and traditional machine learning methods are not suitable to address them.\n\nIn this talk, dedicated to data scientists and machine learning engineers with at least 3 years of experience, we\u2019ll show why this is the case, we\u2019ll introduce the fundamental tools for causal thinking and show how to translate them into code.\n\nWe\u2019ll discuss a popular use case of churn prevention and demonstrate why only causal models should be used to solve it.\n\nAll in Python, repo included!\n\nPyData is an educational program of NumFOCUS, a 501(c)3 non-profit organization in the United States. PyData provides a forum for the international community of users and developers of data analysis tools to share ideas and learn from each other. The global PyData network promotes discussion of best practices, new approaches, and emerging technologies for data management, processing, analytics, and visualization. PyData communities approach data science using many languages, including (but not limited to) Python, Julia, and R. \n\nPyData conferences aim to be accessible and community-driven, with novice to advanced level presentations. PyData tutorials and talks bring attendees the latest project features along with cutting-edge use cases.\n\n00:00 Welcome!\n00:10 Help us add time stamps or captions to this video! See the description for details.\n\nWant to help add timestamps to our YouTube videos to help with discoverability? Find out more here: https://github.com/numfocus/YouTubeVideoTimestamps",
"duration": 2559,
"language": "eng",
"recorded": "2023-11-01",
"related_urls": [
{
"label": "Conference Website",
"url": "https://pydata.org/nyc2023/"
},
{
"label": "https://github.com/numfocus/YouTubeVideoTimestamps",
"url": "https://github.com/numfocus/YouTubeVideoTimestamps"
}
],
"speakers": [
"TODO"
],
"tags": [
"Education",
"Julia",
"NumFOCUS",
"Opensource",
"PyData",
"Python",
"Tutorial",
"coding",
"how to program",
"learn",
"learn to code",
"python 3",
"scientific programming",
"software"
],
"thumbnail_url": "https://i.ytimg.com/vi/e2Q6r6I4QVA/maxresdefault.jpg",
"title": "Aleksander Molak - A Practical Guide to Causality in Python (For The Perplexed) | PyData NYC 2023",
"videos": [
{
"type": "youtube",
"url": "https://www.youtube.com/watch?v=e2Q6r6I4QVA"
}
]
}
Original file line number Diff line number Diff line change
@@ -0,0 +1,47 @@
{
"description": "www.pydata.org\n\nhttps://drive.google.com/file/d/1BFiYjap4t92GR_FtAhHOaK5X_WeiANwX/view?usp=drive_link\n\nAs a community organizer, I have had the privilege of being a part of the Python community for the past ten years.\n\nIn that time, I have seen the community grow and evolve in countless ways. Python evolved from a top 10 language to the top 1 language. Many new people joined the community, new topics as data & AI became part of Python. I'm super proud the preferred language to learn is Python nowadays.\n\nI have also learned a great deal about what it takes to be an effective organizer and how to build and sustain a healthy community. I also experience how not to do it and pulling through in hard times.\n\nI learned a lot about leadership. I learned a lot about myself, my strength and my weaknesses.\nThis helped me also to grow professionally and had a very positive impact on how I work and lead in my day-job as partner in a data and AI consultancy.\n\nPyData is an educational program of NumFOCUS, a 501(c)3 non-profit organization in the United States. PyData provides a forum for the international community of users and developers of data analysis tools to share ideas and learn from each other. The global PyData network promotes discussion of best practices, new approaches, and emerging technologies for data management, processing, analytics, and visualization. PyData communities approach data science using many languages, including (but not limited to) Python, Julia, and R. \n\nPyData conferences aim to be accessible and community-driven, with novice to advanced level presentations. PyData tutorials and talks bring attendees the latest project features along with cutting-edge use cases.\n\n00:00 Welcome!\n00:10 Help us add time stamps or captions to this video! See the description for details.\n\nWant to help add timestamps to our YouTube videos to help with discoverability? Find out more here: https://github.com/numfocus/YouTubeVideoTimestamps",
"duration": 2561,
"language": "eng",
"recorded": "2023-11-01",
"related_urls": [
{
"label": "Conference Website",
"url": "https://pydata.org/nyc2023/"
},
{
"label": "https://github.com/numfocus/YouTubeVideoTimestamps",
"url": "https://github.com/numfocus/YouTubeVideoTimestamps"
},
{
"label": "https://drive.google.com/file/d/1BFiYjap4t92GR_FtAhHOaK5X_WeiANwX/view?usp=drive_link",
"url": "https://drive.google.com/file/d/1BFiYjap4t92GR_FtAhHOaK5X_WeiANwX/view?usp=drive_link"
}
],
"speakers": [
"TODO"
],
"tags": [
"Education",
"Julia",
"NumFOCUS",
"Opensource",
"PyData",
"Python",
"Tutorial",
"coding",
"how to program",
"learn",
"learn to code",
"python 3",
"scientific programming",
"software"
],
"thumbnail_url": "https://i.ytimg.com/vi_webp/NcO2v6DDH10/maxresdefault.webp",
"title": "Alexander CS Hendorf - Ten Years of Community Organizer | PyData NYC 2023",
"videos": [
{
"type": "youtube",
"url": "https://www.youtube.com/watch?v=NcO2v6DDH10"
}
]
}
Original file line number Diff line number Diff line change
@@ -0,0 +1,47 @@
{
"description": "www.pydata.org\n\nhttps://drive.google.com/file/d/1FZ6GgWplGe8j_fKmfqAKNmYmTHQgrEd2/view?usp=drive_link\n\nDeveloping reliable code without writing tests may be a far off dream, but Hypothesis' ghostwriter function will generate tests from type hints. The resulting tests are powerful and often appropriate for data analysis. In this talk, I'll discuss how to add tests to your data analysis code that cover a wide range of inputs -- all while using just a small amount of code.\n\nPyData is an educational program of NumFOCUS, a 501(c)3 non-profit organization in the United States. PyData provides a forum for the international community of users and developers of data analysis tools to share ideas and learn from each other. The global PyData network promotes discussion of best practices, new approaches, and emerging technologies for data management, processing, analytics, and visualization. PyData communities approach data science using many languages, including (but not limited to) Python, Julia, and R. \n\nPyData conferences aim to be accessible and community-driven, with novice to advanced level presentations. PyData tutorials and talks bring attendees the latest project features along with cutting-edge use cases.\n\n00:00 Welcome!\n00:10 Help us add time stamps or captions to this video! See the description for details.\n\nWant to help add timestamps to our YouTube videos to help with discoverability? Find out more here: https://github.com/numfocus/YouTubeVideoTimestamps",
"duration": 1840,
"language": "eng",
"recorded": "2023-11-01",
"related_urls": [
{
"label": "Conference Website",
"url": "https://pydata.org/nyc2023/"
},
{
"label": "https://drive.google.com/file/d/1FZ6GgWplGe8j_fKmfqAKNmYmTHQgrEd2/view?usp=drive_link",
"url": "https://drive.google.com/file/d/1FZ6GgWplGe8j_fKmfqAKNmYmTHQgrEd2/view?usp=drive_link"
},
{
"label": "https://github.com/numfocus/YouTubeVideoTimestamps",
"url": "https://github.com/numfocus/YouTubeVideoTimestamps"
}
],
"speakers": [
"TODO"
],
"tags": [
"Education",
"Julia",
"NumFOCUS",
"Opensource",
"PyData",
"Python",
"Tutorial",
"coding",
"how to program",
"learn",
"learn to code",
"python 3",
"scientific programming",
"software"
],
"thumbnail_url": "https://i.ytimg.com/vi/4WpJPn0_qXY/maxresdefault.jpg",
"title": "Andy Fundinger - Adventures in not writing tests | PyData NYC 2023",
"videos": [
{
"type": "youtube",
"url": "https://www.youtube.com/watch?v=4WpJPn0_qXY"
}
]
}
Original file line number Diff line number Diff line change
@@ -0,0 +1,43 @@
{
"description": "www.pydata.org\n\nPython promises productivity, GPUs promise performance, but if you ever try to fire up a program on a GPU you will find that it is often slower than a CPU. Over the last decade, the Python ecosystem has embraced GPUs in numerous libraries and techniques. We survey what works with GPUs and some of the libraries that one can use to accelerate the Python workflow on a GPU.\n\nPyData is an educational program of NumFOCUS, a 501(c)3 non-profit organization in the United States. PyData provides a forum for the international community of users and developers of data analysis tools to share ideas and learn from each other. The global PyData network promotes discussion of best practices, new approaches, and emerging technologies for data management, processing, analytics, and visualization. PyData communities approach data science using many languages, including (but not limited to) Python, Julia, and R. \n\nPyData conferences aim to be accessible and community-driven, with novice to advanced level presentations. PyData tutorials and talks bring attendees the latest project features along with cutting-edge use cases.\n\n00:00 Welcome!\n00:10 Help us add time stamps or captions to this video! See the description for details.\n\nWant to help add timestamps to our YouTube videos to help with discoverability? Find out more here: https://github.com/numfocus/YouTubeVideoTimestamps",
"duration": 2182,
"language": "eng",
"recorded": "2023-11-01",
"related_urls": [
{
"label": "Conference Website",
"url": "https://pydata.org/nyc2023/"
},
{
"label": "https://github.com/numfocus/YouTubeVideoTimestamps",
"url": "https://github.com/numfocus/YouTubeVideoTimestamps"
}
],
"speakers": [
"TODO"
],
"tags": [
"Education",
"Julia",
"NumFOCUS",
"Opensource",
"PyData",
"Python",
"Tutorial",
"coding",
"how to program",
"learn",
"learn to code",
"python 3",
"scientific programming",
"software"
],
"thumbnail_url": "https://i.ytimg.com/vi/_URmd_ff8HU/maxresdefault.jpg",
"title": "Andy Terrel - The Beauty and the Beast: Python on GPUS | PyData NYC 2023",
"videos": [
{
"type": "youtube",
"url": "https://www.youtube.com/watch?v=_URmd_ff8HU"
}
]
}
Original file line number Diff line number Diff line change
@@ -0,0 +1,47 @@
{
"description": "www.pydata.org\n\nhttps://drive.google.com/file/d/1kNBoM5txEUvrx4oaJpuX4vHGBdAh9xPU/view?usp=drive_link\n\nMany data science initiatives fail because of the unavailability of good data.\n\nIn this talk, I go over examples of bad data I\u2019ve encountered in real life projects. I present hypotheses about the reasons that lead to bad data; tools, techniques and patterns to \u201cdebug and correct\u201d bad data; simple workflows for validation, verification and automation to identify bad data using commonly available tools from the PyData and Python ecosystem. Finally, I\u2019ll go over some prescriptive techniques for how you can approach data projects with non-ideal-data to improve odds of success.\n\nPyData is an educational program of NumFOCUS, a 501(c)3 non-profit organization in the United States. PyData provides a forum for the international community of users and developers of data analysis tools to share ideas and learn from each other. The global PyData network promotes discussion of best practices, new approaches, and emerging technologies for data management, processing, analytics, and visualization. PyData communities approach data science using many languages, including (but not limited to) Python, Julia, and R. \n\nPyData conferences aim to be accessible and community-driven, with novice to advanced level presentations. PyData tutorials and talks bring attendees the latest project features along with cutting-edge use cases.\n\n00:00 Welcome!\n00:10 Help us add time stamps or captions to this video! See the description for details.\n\nWant to help add timestamps to our YouTube videos to help with discoverability? Find out more here: https://github.com/numfocus/YouTubeVideoTimestamps",
"duration": 2126,
"language": "eng",
"recorded": "2023-11-01",
"related_urls": [
{
"label": "Conference Website",
"url": "https://pydata.org/nyc2023/"
},
{
"label": "https://github.com/numfocus/YouTubeVideoTimestamps",
"url": "https://github.com/numfocus/YouTubeVideoTimestamps"
},
{
"label": "https://drive.google.com/file/d/1kNBoM5txEUvrx4oaJpuX4vHGBdAh9xPU/view?usp=drive_link",
"url": "https://drive.google.com/file/d/1kNBoM5txEUvrx4oaJpuX4vHGBdAh9xPU/view?usp=drive_link"
}
],
"speakers": [
"TODO"
],
"tags": [
"Education",
"Julia",
"NumFOCUS",
"Opensource",
"PyData",
"Python",
"Tutorial",
"coding",
"how to program",
"learn",
"learn to code",
"python 3",
"scientific programming",
"software"
],
"thumbnail_url": "https://i.ytimg.com/vi/LZU5fO1cE5g/maxresdefault.jpg",
"title": "Avishek Panigrahi - Bad data - anecdotes and examples from the real world | PyData NYC 2023",
"videos": [
{
"type": "youtube",
"url": "https://www.youtube.com/watch?v=LZU5fO1cE5g"
}
]
}
Original file line number Diff line number Diff line change
@@ -0,0 +1,43 @@
{
"description": "www.pydata.org\n\nData scientists strive to bridge the gap between raw data and actionable insights. Yet, the actual value of data lies in its accessibility to non-data experts who can unlock its potential independently. Join us in this hands-on tutorial hosted by experts from Vizzu and Streamlit to discover how to transform data analysis into a dynamic, interactive experience.\n\nStreamlit, celebrated for its user-friendly data app development platform, has recently integrated with Vizzu's ipyvizzu - an innovative open-source data visualization tool that emphasizes animation and storytelling. This collaboration empowers you to craft and share interactive, animated reports and dashboards that transcend traditional static presentations.\n\nTo maximize our learning time, please come prepared by following the setup steps listed at the end of the tutorial description, allowing us to focus solely on skill-building and progress.\n\nPyData is an educational program of NumFOCUS, a 501(c)3 non-profit organization in the United States. PyData provides a forum for the international community of users and developers of data analysis tools to share ideas and learn from each other. The global PyData network promotes discussion of best practices, new approaches, and emerging technologies for data management, processing, analytics, and visualization. PyData communities approach data science using many languages, including (but not limited to) Python, Julia, and R. \n\nPyData conferences aim to be accessible and community-driven, with novice to advanced level presentations. PyData tutorials and talks bring attendees the latest project features along with cutting-edge use cases.\n\n00:00 Welcome!\n00:10 Help us add time stamps or captions to this video! See the description for details.\n\nWant to help add timestamps to our YouTube videos to help with discoverability? Find out more here: https://github.com/numfocus/YouTubeVideoTimestamps",
"duration": 3462,
"language": "eng",
"recorded": "2023-11-01",
"related_urls": [
{
"label": "Conference Website",
"url": "https://pydata.org/nyc2023/"
},
{
"label": "https://github.com/numfocus/YouTubeVideoTimestamps",
"url": "https://github.com/numfocus/YouTubeVideoTimestamps"
}
],
"speakers": [
"TODO"
],
"tags": [
"Education",
"Julia",
"NumFOCUS",
"Opensource",
"PyData",
"Python",
"Tutorial",
"coding",
"how to program",
"learn",
"learn to code",
"python 3",
"scientific programming",
"software"
],
"thumbnail_url": "https://i.ytimg.com/vi/dVCvJYfR38k/maxresdefault.jpg",
"title": "Blackwood & Vidos - Creating Interactive, Animated Reports in Streamlit with Vizzu | PyData NYC 2023",
"videos": [
{
"type": "youtube",
"url": "https://www.youtube.com/watch?v=dVCvJYfR38k"
}
]
}
Loading