Data Collection
Cultural Analytics Open Science Guide
Login to your student account
Introduction
1
Datasets
Version Control
2
The Command Line
3
README file
4
General Workflow of a Version Control System (VCS)
5
Git and GitHub
6
Create a GitHub Account
7
Github Flow
8
The
git commit
Command
9
Retrieving and Comparing Versions
10
Branches and Git
11
Merging Branches in Git
12
Interactive, Visual Git
13
Git Commands to Work on GitHub
14
The Clone Wars
15
Git Summary
16
Version Control for Data
Data Analysis (Pandas)
17
Pandas Basics — Part 1
18
Pandas Basics Part 1 — Workbook
19
Pandas Basics — Part 2
20
Pandas Basics Part 2 — Workbook
21
Pandas Basics — Part 3
22
Pandas Basics Part 3 — Workbook
23
Pandas Basics — More Useful Methods — Workbook
24
Pandas — Merge Datasets
Data Collection
25
Users’ Data: Legal & Ethical Considerations
26
Web Scraping — Part 1
27
Web Scraping Part 1 — Workbook
28
Web Scraping — Part 2
29
Web Scraping — Part 2 — Workbook
30
Application Programming Interfaces (APIs)
31
APIs — Workbook
Data collection examples
32
Song Genius Data Collection
33
Song Genius API
34
Song Lyrics Collection
35
Song Lyrics Analysis
36
Twitter Data
37
Twitter API Setup
38
Twitter Data Collection & Analysis
39
Twitter Data Sharing
40
Reddit Data Collection and Analysis with PSAW
41
100 Historical Spanish Newspaper Editions
Exploratory Data Analysis (in-depth)
42
Data Types and Quality of the Data
43
Tips for recognizing the data types
44
Categorical Variable Data Type
45
Data Information
46
Exploratory Data Analysis and Summary Statistics
47
Association between variables
Data Collection
Code
This series of lessons will focus on how to collect cultural data from the internet:
24
Pandas — Merge Datasets
25
Users’ Data: Legal & Ethical Considerations