“Parks and Recreations” selected food words

“Parks and Recreations” selected food words#

This notebook visualizes selected food words from the TV show “Parks and Recreations”.

Source code: https://codeberg.org/penguinsfly/tv-mania/src/branch/main/pandr

Motivation#

Why? Cuz the calzones …

Well, some foods are unusually used more than normal in the show, calzones being of them (from Ben), and of course waffles from Leslie.

This manually selects the following words to count from the show transcripts:

calzone
pancake
waffle
pizza
steak
beef
burger
pie

You could already guess that calzone & pizza were chosen because of Ben, pancake & waffle cuz of Leslie and steak & beef cuz of Ron

Obtain data#

The data file parks-and-recreation_scripts.csv was obtained using sf2 that downloaded the scripts from Springfield! Springfield!.

sf2 --show "parks-and-recreation" --format csv

Import modules#

Load scripts & count words#

data_path = 'parks-and-recreation_scripts.csv'

words = [
    'calzone', 'pancake', 'waffle', 'pizza', 
    'steak', 'beef', 'burger', 'pie',
]

Count words from data#

script_df = pd.read_csv(data_path, index_col=None)
script_df = script_df.apply(lambda df: count_words(df, words), axis=1)

script_df

	id_text	show	season	episode	episode_title	script	url	total_count	norm_lino	calzone_series	...	pizza_series	pizza_count	steak_series	steak_count	beef_series	beef_count	burger_series	burger_count	pie_series	pie_count
0	Parks and Recreation s01e01 Episode Script	Parks and Recreation	1	1	CHE01 - Pilot	Hello.\nHi.\nMy name is Leslie Knope, and I wo...	https://www.springfieldspringfield.co.uk/view_...	3250	[0.0]	[0]	...	[0]	0	[0]	0	[0]	0	[0]	0	[0]	0
1	Parks and Recreation s01e02 Episode Script	Parks and Recreation	1	2	CHE03 - Canvassing	LESLIE Well, one of the funner things that we ...	https://www.springfieldspringfield.co.uk/view_...	3561	[0.0]	[0]	...	[0]	0	[0]	0	[0]	0	[0]	0	[0]	0
2	Parks and Recreation s01e03 Episode Script	Parks and Recreation	1	3	102 - The Reporter	Okay, now, see, here's a good example of a pla...	https://www.springfieldspringfield.co.uk/view_...	3380	[0.0]	[0]	...	[0]	0	[0]	0	[0]	0	[0]	0	[0]	0
3	Parks and Recreation s01e04 Episode Script	Parks and Recreation	1	4	CHE04 - Boys' Club	TOM So, we've been called out to this hiking t...	https://www.springfieldspringfield.co.uk/view_...	3337	[0.0]	[0]	...	[0]	0	[0]	0	[0]	0	[0]	0	[0]	0
4	Parks and Recreation s01e05 Episode Script	Parks and Recreation	1	5	CHE05 - The Banquet	In a town as old as Pawnee, there's a lot of h...	https://www.springfieldspringfield.co.uk/view_...	3530	[0.0]	[0]	...	[0]	0	[0]	0	[0]	0	[0]	0	[1]	1
...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...
117	Parks and Recreation s07e08 Episode Script	Parks and Recreation	7	8	Ms. Ludgate-Dwyer Goes to Washington	Okay.\nSo, the central question that these sen...	https://www.springfieldspringfield.co.uk/view_...	3915	[0.0]	[0]	...	[0]	0	[0]	0	[0]	0	[0]	0	[0]	0
118	Parks and Recreation s07e09 Episode Script	Parks and Recreation	7	9	Pie-Mary	We need to go over the schedule leading up to ...	https://www.springfieldspringfield.co.uk/view_...	3700	[0.0]	[4]	...	[0]	0	[0]	0	[0]	0	[0]	0	[24]	24
119	Parks and Recreation s07e10 Episode Script	Parks and Recreation	7	10	The Johnny Karate Super Awesome Musical Explos...	It's the Johnny Karate Super Awesome Musical E...	https://www.springfieldspringfield.co.uk/view_...	3121	[0.0]	[0]	...	[1]	1	[0]	0	[0]	0	[0]	0	[0]	0
120	Parks and Recreation s07e11 Episode Script	Parks and Recreation	7	11	Two Funerals	Hey.\nHey, guys.\nListen.\nUh, Leslie's gonna ...	https://www.springfieldspringfield.co.uk/view_...	3531	[0.0]	[0]	...	[2]	2	[0]	0	[0]	0	[0]	0	[0]	0
121	Parks and Recreation s07e12 Episode Script	Parks and Recreation	7	12	One Last Ride, Part 1 & 2	Which brings us to 2005.\nThe Circle Park reno...	https://www.springfieldspringfield.co.uk/view_...	6789	[0.0]	[1]	...	[1]	1	[2]	2	[1]	1	[0]	0	[0]	0

122 rows × 25 columns

Compare data frequency to `rspeer/wordfreq`#

This attemps to establish how uncommon these selected words are

The base frequencies are from using rspeer/wordfreq.

Higher ratios between script_freq and base_freq means such words are unusually higher than expected.

No additional comparisons or corrections are done, so take these numbers with a grain of salt.

word_freq_df

	word	script_freq	base_freq	script_count	ratio	log_ratio	z_stat	p_val
0	calzone	0.000058	1.580000e-07	26	368.541199	2.566486	5.085332	1.834919e-07
1	waffle	0.000123	2.000000e-06	55	61.588904	1.789502	7.296233	1.479677e-13
2	pancake	0.000029	2.240000e-06	13	12.997658	1.113865	3.328200	4.370460e-04
3	burger	0.000096	1.170000e-05	43	8.230996	0.915452	5.761040	4.179868e-09
4	steak	0.000054	9.330000e-06	24	5.761020	0.760499	4.048722	2.574908e-05
5	pizza	0.000150	2.690000e-05	67	5.578177	0.746492	6.718468	9.182227e-12
6	pie	0.000076	1.550000e-05	34	4.912663	0.691317	4.644206	1.706934e-06
7	beef	0.000025	1.950000e-05	11	1.263362	0.101528	0.691396	2.446582e-01

Visualize results#

Counts of words throughout the show#

../_images/d3c7af2f9f394152fa52c9e784ae4fc003de7d76534565de30715b83884c14b2.png

How are these words?#

Remember higher means more unusually high, e.g. calzone was used about 350 times more frequently than expected.

Anyone who has watched the show can atest to this lol.

../_images/49b5bfe45e1024b06224652434d6dd81735b4240d63bc20e913268684b901a81.png

Putting it all together w/ annotations#

I put those 2 figures together in Inkscape with some annotations for the final presentatiton

from IPython.display import Image, display
display(Image(filename='figures/pandr-foods.png'))

../_images/bfdba9802d99ed2d0d74950ba56e1a4f8415a6f1d24b5645dcfeabc8e03611b5.png