Day 3 – More on Python

Wednesday, June 8

9 am–1 pm Pacific Time

Connection Details: Lesson Room Link

Lesson notes:   Web scraping with Python

  API querying (open pdf in a new tab)
  API querying (open Colab notebook in new tab)

  Topic modeling (open slides in new tab)
  Topic modeling (download data)
  Topic modeling (download Jupyter notebook)

Lesson Plan


Web Scraping (9:00–10:15 am)

  • What is web scraping?
  • Introduction to Beautiful Soup
  • Parsing and saving data
  • Using Regular Expressions
  • To scrape or not to scrape?

Break (10:15–10:20 am)

API Querying (10:20 am–11:35 am)

  • What is an API?
  • The wide world of Web-API's
  • Getting Setup
  • Endpoints
  • Querying
  • Formatting and extracting data
  • Authentication

Break (11:35–11:45 am)

Topic Modeling (11:45–1:00 pm)

  • What is Topic modeling?
  • Reading other peopl's code
  • Formatting data
  • Building a model
  • Coherence and choosing a model
  • Visualization