Remainder of semester

1. In-class

  1. Tue 12/6:
    1. Tutorial on learning more about GitHub Actions automation
  2. Thu 12/8:
    1. Online tool to create your own package hex sticker
  3. Tue 12/13:
    1. If your vignette doesn’t show up in the list when you run vignette() in the console, see this fix. tldr: you need to run devtools::install(build_vignettes=TRUE)
    2. Complete course evaluations (note on evaluations bias)
    3. Thank yous
    4. Seniors: stay in touch
    5. Final comments

2. Items due

Due Wednesday 12/21 at 3pm, as described on projects page

  1. Package on GitHub
  2. Programming development on Moodle
  3. Peer evaluations Google Form

Lec 23: Thu 12/1

1. Announcements

2. Chalk talk/lecture

  1. Reference R packages Ch 17
  2. Run vignettes(), then vignettes(package = "IPEDS")
  3. README is a quickstart guide vs vignette is an instruction manual
  4. Create vignettes using usethis::use_vignette() wrapper function
  5. Go to RStudio menu -> Tools -> Project Options -> Build Tools -> Make sure “Use roxygen to generate Vignettes” is checked

3. In-class exercise

  1. Start your vignette
  2. See this note about vignettes not showing up when you run vignette(). tldr: you need to run devtools::install(build_vignettes=TRUE)
  3. Group feedback

Lec 22: Tue 11/29

1. Announcements

2. Chalk talk/lecture

3. In-class exercise

4. For next time


Lec 21: Tue 11/22

1. Announcements

  1. Joint SDS/PHY new faculty hire announcement in Slack

2. Chalk talk/lecture

3. In-class exercise

  1. Continuing group feedback, cycling through groups starting with Team Jalopy.

4. For next time

  1. Project Phase II will be graded anytime after Tue 11/22 5pm

Lec 20: Thu 11/17

1. Announcements

  1. Project phase I feedback and grades delivered
  2. Please note that your package must have at least one dataset AND at least one function

2. Chalk talk/lecture

  1. NA

3. In-class exercise

  1. Work on Project Phase II

4. For next time

  1. Project Phase II due Tue 11/22 at 5pm

Lec 19: Thu 11/10

1. Announcements

  1. Office hours today cancelled. If you were planning to come today, please DM me.

2. Chalk talk/lecture

  1. R CMD CHECK
    1. Executive summary
    2. I have no idea what Warning: invalid gid value replaced by that for user 'nobody' means. I encounter it all the time, but it’s never been a problem when I’ve submitted CRAN packages
    3. In abbott demo example, one error found: No tests
  2. Testing
    1. Being pro-active and defensive against bugs/errors
    2. Examples: IPEDS tests that AJ defined
  3. In abbott demo example
    1. Run usethis::use_test("apples")
    2. This creates tests/testthat/test-apples.R
    3. Click “Run Tests”

3. In-class exercise

  1. Finish adding one dataset to R package
  2. Check your package with R CMD CHECK
  3. Add one test to your package, however trivial

4. For next time


Lec 18: Tue 11/8

1. Announcements

  1. Before class starts, ensure you can load these packages

    library(tidyverse)
    library(moderndive)
    library(scales)
    library(janitor)
  2. Added clarification to instructions for Project Phase 2

  3. Start using R packages book as a reference (not a great learning tool IMO). Today’s reference chapter is Chapter 8 on Data.

2. Chalk talk/lecture

a) Example R package data

Example of adding a dataset to an R package: moderndive::DD_vs_SB by Middlebury College ’17 grad Delaney Moran.

  1. Always look at your data first: View(DD_vs_SB)
  2. Numbers are numbers, but data has context: Know your data’s context, here by looking at help file ?DD_vs_SB

b) Steps

  1. Raw data that Middlebury student Delaney Moran scraped from web
  2. Source code data-raw/process_data_sets.R for
    1. Cleaning data
    2. Using it in package via usethis::use_data(). This creates the output data/DD_vs_SB.rda which is a compressed data file that you shouldn’t directly modify.
  3. Source code R/datasets for documenting data set using roxygen2 code. This creates the output man/DD_vs_SB.Rd which is a R help file that you shouldn’t directly modify.

c) Screencast demo

  1. Download apples.csv from Slack
  2. Ensure everytime you build and install package, roxygen2 documentation is updated
  3. Run usethis::use_raw_data()
  4. Build early, build often!
  5. Tricky: creating R/datasets.R documentation file for the first time. I’m going copy and paste this

3. In-class exercise

  1. If you haven’t finished this, do this by the end of today:
    1. Post a link to GitHub repo of your group’s R package in groups Google Sheet. This should have R package structure: DESCRIPTION file, R/ folder, etc.
    2. If a README.Rmd source document doesn’t exist already, create it by running usethis::use_readme_rmd(). Remember to only edit README.Rmd and knit it to create the README.md cover page for your repo
    3. Every group member makes a least one commit, even if only trivial. Every group member should show up on Contributors page
    4. Ensure that Swaha or I can install your package at any time using devtools::install_github(). In other words, the code in the master branch should always work. That way we can do spot-checks for frequent and low-stakes (ungraded) check-ins.
  2. New Add any csv of data.

4. For next time


Lec 17: Thu 11/03

1. Announcements

  1. Check out AJ Haller and Alejandra Munoz Garcia’s IPEDS package:
    1. Available on CRAN as of Nov 1st
    2. Source code on GitHub. See AJ and Alejandra’s contributions by going to GitHub page -> right hand menu -> Contributors

2. Chalk talk/lecture

  1. Initializing local and remote repos/directories
    1. Remote to local (done previously)
    2. Local to remote (other way)
  2. Two ways to install someone else’s GitHub R package locally:
    1. Clone repo locally and “Build” (like Lec10)
    2. Use devtools::install_github() or remotes::install_github()
  3. Screencast recording of initializing git on a local directory and then pushing contents to a new GitHub repo

3. In-class exercise

  1. Testing install_github() on IPEDS
  2. By the end of today:
    1. Post a link to GitHub repo of your group’s R package in groups Google Sheet. This should have R package structure: DESCRIPTION file, R/ folder, etc.
    2. If a README.Rmd source document doesn’t exist already, create it by running usethis::use_readme_rmd(). Remember to only edit README.Rmd and knit it to create the README.md cover page for your repo
    3. Every group member makes a least one commit, even if only trivial. Every group member should show up on Contributors page
  3. By the end of Tuesday:
    1. Ensure that Swaha or I can install your package at any time using devtools::install_github(). In other words, the code in the master branch should always work
    2. That way we can do spot-checks for frequent and low-stakes (ungraded) check-ins.

4. For next time

  1. Final quiz today by 5pm on environments and conditionals

Lec 16: Tue 11/01

1. Announcements

2. Chalk talk/lecture

  1. Conditionals

3. In-class exercise

  1. Lab #12: Environments (not graded)

4. For next time


Lec 15: Thu 10/27

1. Announcements

2. Chalk talk/lecture

  1. Environments

3. In-class exercise

  1. Lab #11: Environments (not graded)

4. For next time

  1. Clarification on different map types
  2. Quiz on functionals due tomorrow 5pm

Lec 14: Tue 10/25

1. Announcements

  1. Discuss Project Phase II

2. Chalk talk/lecture

  1. From Lec 13: Revisit slide 12 at the end of More functions
    1. map_int(c(100, 53), what_do_i_return) is not a good example of error generation because two errors are genererated at once: 100 returns “100”, and 53 returns 53 which is a double, not an int
  2. Functionals (purrr::map)

3. In-class exercise

  1. Lab #10: functionals - themap() family of functions (not graded)

4. For next time


Lec 13: Thu 10/20

1. Announcements

  1. Schedule shift
    1. Today: Functionals More on functions.
    2. Because of this quiz on functionals is pushed to next week, Friday 10/28 5pm

2. Chalk talk/lecture

  1. More functions

3. In-class exercise

  1. Lab #9: More functions: the dots (not graded)

4. For next time


Lec 12: Tue 10/18

1. Announcements

  1. “Show, don’t tell” what the README should look like for your Project Phase I.
  2. For future reference: How to create future README.Rmd files that generate the README.md cover page of your repos:
    1. RStudio Menu -> File -> New File -> R Markdown…
    2. From Template
    3. GitHub Document (Markdown)
    4. Name: README
    5. Location: In root directory of your repo

2. Chalk talk/lecture

  1. None

3. In-class exercise

  1. Work on Project Phase I assigned, due Tue 10/18 6pm on GitHub Classroom.
  2. If you’re done, start thinking about your MVP package for Project Phase II
    1. See other past packages; I just added packages from Spring 2022
    2. Take a look at GitHub repos for your favorite packages. You understand the basic structure!

4. For next time


Lec 11: Thu 10/13

1. Announcements

  1. Post on Slack #general about GEM program for American Indian/Native, African American/Black, or Hispanic American/Latino students interested in graduate studies in STEM

2. Chalk talk/lecture

  1. Recap of Lec10 -> Lab #7 R Packages -> Exercise 13 which I added after class.
  2. Functions

3. In-class exercise

  1. Lab #8: Functions (not graded)

4. For next time

  1. Quiz on functions due tomorrow 5pm
  2. Project Phase I assigned, due Tue 10/18 6pm on GitHub Classroom. Classtime will be given on Tuesday to work on this.

Lec 10: Thu 10/6

1. Announcements

  1. Install/update devtools and roxygen2 packages
  2. For the rest of the semester, please sit next to your groupmates, package work starts today!
  3. Importance of boundaries
  4. Because we haven’t covered functions yet, we need to push back “Project Phase I: Basics” due date from Thu 10/13 to Tue 10/18

2. Chalk talk/lecture

  1. Second attempt of Lec 02 on Thu 9/8: A MVP of an R package

3. In-class exercise

  1. Lab #7: R Packages (not graded)

4. For next time


Lec 09: Tue 10/4

1. Announcements

  1. Discuss Project Phase I: Basics due on Thu 10/13
  2. Install/update and load reprex R package
  3. Please ensure you’ve marked completion of all GitHub exercises in the students Google Sheet; all incomplete ones are marked in yellow.

2. Chalk talk/lecture

Reproducible examples

  1. Show don’t tell your error: Create a REPRoducible EXamples called a reprex
    1. Getting help with the following code.
    library(ggplot2)
    library(dplyr)
    library(lobstr)
    
    # How do I compute average price?
    diamonds %>% 
      summarize(avg_price = avg(price))
    1. Load library(reprex)
    2. Copy the parts of above code and run reprex()
    3. Copy the output
  2. In-line code and code blocks in Slack messages
    1. Paste code
  3. reprex’s should be minimal (eliminate non-relevant code) and need to be viable (code should reproduce the error).
  4. Basic curtesy and programmer etiquette
  5. IRL: filing a GitHub issue for the source code for the ggplot2 package

Merge conflicts

This occurs when there are unsynced changes of the same line of code. In your project groups:

  1. Clone a local copy of this repo
  2. Go to your group’s folder and open README
  3. Designate group member X, Y, and if applicable Z
  4. Replace X, Y, or Z with your name
  5. Only Member X: commit and push your change first.
  6. After this, Member Y attempt to commit and push change. Note, because Member Y didn’t do a pull first to sync their local repo with the remote, this will result in a merge conflict.
  7. Resolve merge conflict
  8. After this, Member Z attempt to commit and push change to create another merge conflict
  9. Resolve merge conflict

Control flow

  1. Control flow

3. In-class exercise

  1. Lab #6: Control Flow (not graded)

4. For next time

  1. Project Phase I go over:
    1. Submission format
    2. Code review

Lec 08: Thu 9/29

1. Announcements

  1. Be sure to post links to students Google Sheet once you’ve completed all steps of Lec 07 exercises. Recall I posted the screencast of the in-class demo here of the three modes of collaboration:
    1. Self-collaborator
    2. Outside collaborator
    3. Inside collaborator

2. Chalk talk/lecture

  1. Recall Lec 07 “Outside Collaborator” pull request based on your forked copy of sds270 which you could not directly edit.
    1. What does this look like from the perspective of master sds270 owners: Me and Swaha?
    2. Study the anatomy of a pull request page
  2. A real-life example of such pull requests: my contribution to gapminder R package
    1. If you haven’t already, install gapminder package
    2. Bring up the help file for ?gapminder
    3. Scroll down to info on gdpPercap variable
  3. S3 object oriented programming

3. In-class exercise

  1. Lab #5: S3 object oriented programming (not graded)

4. For next time

  1. Quiz due tomorrow at 5pm
  2. Discuss Project Phase I: Basics due on Thu 10/13
  3. Git merge conflicts: When two or more people edit and commit the same line of code.

Lec 07: Tue 9/27

1. Announcements

  1. Sit with your group members today
  2. Note sds270 GitHub Organization for this course: Click on GitHub icon on top right of webpage
  3. Note when office hours and Spinelli hours are: GitHub is a skill, it takes practice
  4. Accountability reminder from Lec 06
    1. Accept emails invitations from GitHub
    2. Verify that you see yourself in this roster
    3. Indicate that you can see yourself the roster in Students Sheet

2. Chalk talk/lecture

In-class demo on how to applying GitHub terminology and terminology we covered in Lec 06. Going forward, I will post screencasts of all in-class demos here.

  1. Self-Collaborator (starts at 00:00): Making changes to a repo that only you can edit
    1. Create a new repo on GitHub.com called test270. Set: description as “testing”, Public, add a README file, add a .gitignore file using the R template, no license.
    2. (Using RStudio interface to GitHub) Clone the remote version of this repo locally. i.e. download a copy onto your computer
    3. Add your name to README.md
    4. Commit and push changes. i.e. stage your changes and then upload your changes
    5. Verify your changes on GitHub.com
    6. Post a github.com link to your remote copy of repo in students Google Sheet
  2. Outside collaborator (starts at 5:42): Makes changes to a repo you can’t directly edit i.e. you are an external collaborator. Do this using a forked branch.
    1. (Delete all previous versions of git-demo repos on your computer to avoid confusion)
    2. Go to git-demo remote repo that only Swaha and I can edit
    3. Fork this repo i.e. make a copy of this repo to your own GitHub account.
    4. Clone the remote version of this forked repo locally
    5. Add your name to README.md as a markdown hyperlink to your GitHub profile page
    6. Commit and push changes
    7. Verify your changes on GitHub.com in your forked copy of repo, not master repo
    8. Create a “pull request” to original git-demo repo. i.e. Send a request to Swaha and me to merge your changes in the master copy.
    9. Post a github.com link to your pull request page in students Google Sheet
  3. Inside collaborator (starts at 13:20): Making changes to a repo you and collaborators can directly edit i.e. you are an internal collaborator. Do this using a regular branch.
    1. Go to dusty-carpet repo that you can all edit
    2. Create a new branch using your firstname-lastname i.e. compartmentalize your changes to keep master branch pristine for now
    3. Clone the remote version of this repo locally
    4. Switch to your firstname-lastname branch
    5. In README.md Change flip_switch_1 from 0 to 1 next to your name. Be sure to only edit the single character.
    6. Commit and push changes
    7. Verify your changes on GitHub.com in your firstname-lastname branch, not master branch
    8. Create a “pull request” to master branch. i.e. Create a request to merge your changes in the master branch
    9. Merge your pull request
    10. Delete your firstname-lastname branch
    11. Post a github.com link to your pull request page in students Google Sheet

3. In-class exercise

Complete above three GitHub exercises, posting all links in the students Google Sheet

4. For next time

  1. Complete above three GitHub exercises by Thursday. I have office hours today and tomorrow.
  2. We’ll cover S3 on Thursday

Lec 06: Thu 9/22

1. Announcements

  1. Vectors and GitHub quiz on Moodle due Friday 5pm
  2. All groups with two members: If you’re willing to take a 3rd member, please let me know in the group Slack DM
  3. Join sds270 GitHub organization for this class:
    1. Accept email invitation (check spam folder as well)
    2. Verify that you see yourself in this roster
    3. Confirm you can see yourself in Students Sheet

2. Chalk talk/lecture

  1. GitHub theory behind the GitHub Lab from Lec 05, using this example
  2. Subsetting

3. In-class exercise

  1. Lab #4: Subsetting (not graded)

4. For next time

  1. We’re going to revisit the GitHub Lab from Lec 05, after I’ve had time to fiddle with sds270 GitHub organization

Lec 05: Tue 9/20

1. Announcements

  1. Slack #questions
  2. Install/update the usethis package
  3. In this Google Sheet:
    1. Add your GitHub login
    2. Indicate whether you were able to install git and clone the repo from Lec 04

2. Chalk talk/lecture

  1. GitHub
  2. From today’s reading: creating personal access tokens using the usethis package

3. In-class exercise

  1. Lab #3: GitHub (not graded)
  2. Readings from:
    1. Read Happy Git and GitHub for the UseR as a reference for when you have issues
    2. Read about pull requests using usethis

4. For next time


Lec 04: Thu 9/15

1. Announcements

  1. Reminder: Quiz on Moodle due on Friday 5pm.
  2. Slack everyone subscribe to:
    1. Subscribe to #course-questions - Ask questions about class here: syllabus questions, due dates, etc.
    2. Subscribe to #oh-shit-git - Ask questions about Git/GitHub here
    3. Note I renamed #tech-questions to #r-questions - Ask questions about R here.

2. Chalk talk/lecture

  1. Lesson I learned at Google: Don’t thrash
  2. Vectors

3. In-class exercise

  1. Please let me know your groups of 2-3 students
  2. Lab #2: Names and values (not graded)
  3. Readings from Advanced R:
    1. Ch 3: Vectors (note I assigned this reading on Tuesday, but am splitting it to match lecture and lab)

4. For next time

Before Tuesday, do your best to “clone” this GitHub repository locally i.e. download a copy of this repo onto your computer:

  1. Watch this screencast from my SDS 410 Capstone course. Note they are cloning a different repo.
  2. Fill out your Git status in this Google Sheet. No shame in our game folks “oh shit git” moments are normal, even comical.
  3. If you get stuck, don’t thrash. Instead:
    1. Take a deep breath
    2. Go to #oh-shit-git slack channel and see if anyone has encountered the same issue
      1. If yes, reply in the thread “I have the same issue”
      2. If no, post a screenshot of where you are stuck (i.e. show don’t tell)

Lec 03: Tue 9/13

1. Announcements

  1. Smashing the course reset button
  2. Office hours posted in Syllabus

2. Chalk talk/lecture

  1. Names and Values

3. In-class exercise

  1. If you haven’t already, fill out Intro Survey Google Form
  2. Please let me know your groups of 2-3 students
  3. Lab #1: Names and values (not graded)
  4. Readings from Advanced R:
    1. Ch 1: Intro
    2. Ch 2: Names and values
  5. Before Friday 5pm, quiz on Moodle on Ch 2: Names and values

4. For next time

  1. I’ll have full readings, quiz, and due date schedule

Lec 02: Thu 9/8

1. Chalk Talk

  1. How do I know if my git install was successful?
    1. Terminal window
  2. Executive summary of today’s readings from R packages
    1. Chapter 1
    2. Chapter 2 up to and including 2.15

2. In-class exercise

  1. If you haven’t already, complete all steps from Lec 01 in-class exercise
  2. Do today’s readings, running all code you encounter. You must finish the reading before next lecture.
  3. Start thinking of making groups of 2-3 students for your projects. Once you have a group, think of a group name and come see me.

Lec 01: Tue 9/6

1. Chalk Talk

  1. If you’re on waitlist fill out this Google Form
  2. Introductions
  3. Go over webpage and syllabus
  4. Practice using Slack threads in #questions (so we’re organized)
  5. Mantras for semester:
    1. MVP
    2. DD
    3. Point of diminishing returns

2. In-class exercise

  1. Complete intro survey (so I can get to know you)
  2. Update R to latest version 4.2.1 (so there is no confusion due to versions)
  3. Update RStudio (so you can use Quarto)
  4. Update all packages, staring with tidyverse (so there is no confusion due to versions)
  5. If you haven’t already
    • Create an account on GitHub.com using your Smith email address (so you can get student benefits)
    • Install Git on your computers (so you can collaborate)
    • In RStudio -> Terminal panel: type git. If you don’t get an error message, you’re good.