No need to worry about downloading a database management system! We will be using DB Fiddle.
Workshop Presentations
Data and slides from each workshop will be posted here.
Joins in PostgreSQL: Explanation of Joins in PostgreSQL
Data Sources
Kaggle: Kaggle is a dataset repository, data science competition host, tutorial provider, and more. It has an active community that discusses and contributes solutions to various data science problems.
DATA.GOV: A home for U.S. government open data. Topics include climate, education, finance, and many more.
UCI Machine Learning Repository: The UCI Machine Learning Repository is a collection of databases, domain theories, and data generators that are used by the machine learning community for the empirical analysis of machine learning algorithms. This project is in collaboration with Rexa.info at the University of Massachusetts Amherst and receives funding support from the National Science Foundation.
Fall 2020 workshop information now available at:
NBL Workshop Calendar - https://libcal.rutgers.edu/nblworkshops
Data Science Basics: Introduction to SQL for Database Querying (Beginner)
Thursday | November 12, 2020 | 3:00pm ET (online) --- Workshop Recording
This workshop will be a beginner level introduction to the fundamentals of the database language SQL. We will be using basic query statements and pattern matching to access data about Haas avocado prices (or another freely available alternative).
Data Science Basics: Beyond Basics in SQL (Beginner)
Thursday | November 19, 2020 | 3:00pm ET (online) --- Workshop Recording
This workshop will be a beginner level introduction to summarizing and joining data using SQL. We will be querying from a dataset about Wish eCommerce products and sales (or another freely available alternative).