ridgeplot.datasets.load_probly

ridgeplot.datasets.load_probly(version='zonination')[source]

Load a version of the “Perception of Probability Words” (a.k.a., “probly”) dataset.

Parameters:

version ({'zonination', 'wadefagen', 'illinois'}, default: 'zonination') – The version of the dataset to load. Each version is slightly different and originates from different surveys. See the Notes section for more details on each version.

Returns:

A dataframe containing a probly dataset.

Return type:

pandas.DataFrame

Notes

Sherman Kent, a CIA analyst, first published his work on the perception of probabilistic words in 1964 [1]. This exercise has been repeated several times since then. This function provides three different versions of the dataset, each originating from a different survey. Valid options for the version parameter are:

"zonination"

This is perhaps the most popular version of the dataset and originates from a survey conducted by the Reddit user /u/zonination.

Dataset details...
"wadefagen"

This version of the dataset originates from a blogpost by Wade Fagen-Ulmschneider from the University of Illinois [2]. It is based on a survey conducted on different social media platforms.

Dataset details...
"illinois"

This version of the dataset originates from a survey of primarily undergraduate students conducted at The University of Illinois [3].

Dataset details...

Creator

University of Illinois

Source

https://waf.cs.illinois.edu/discovery/words.csv

Accessed on

2023-06-24

References