The raw data behind the story "What America’s Governors Are Talking About" https://fivethirtyeight.com/features/what-americas-governors-are-talking-about/
state_index state_words
2 data frames about the 50 U.S Governors' Speeches
An object of class spec_tbl_df
(inherits from tbl_df
, tbl
, data.frame
) with 2223 rows and 9 columns.
The chi-square test statistic https://scikit-learn.org/stable/modules/generated/sklearn.feature_selection.chi2.html#sklearn.feature_selection.chi2
A data frame with 50 rows representing the 50 U.S. states and 5 variables:
the state
the name of the state's governor
the party of the state's governor
the filename of the speech in the speeches folder at https://github.com/rudeboybert/fivethirtyeight/tree/master/data-raw/state-of-the-state/speeches
a link to an official/media source for the speech
A data frame with 2,223 rows representing phrases and 9 variables:
one-, two-, and three-word phrases spoken repeatedly
thematic categories for the phrases
number of Democratic speeches containing the phrase
number of Republican speeches containing the phrase
total number of speeches containing the phrase
percent of the 23 Democratic speeches containing the phrase
percent of the 27 Republican speeches containing the phrase
the chi-square test statistic for statistical significance
p-value for chi^2 test