ridgeplot.datasets.load_probly¶

ridgeplot.datasets.load_probly(version='zonination')[source]¶

Load a version of the “Perception of Probability Words” (a.k.a., “probly”) dataset.

Parameters:: version ({'zonination', 'wadefagen', 'illinois'}, default: 'zonination') – The version of the dataset to load. Each version is slightly different and originates from different surveys. See the Notes section for more details on each version.
Returns:: A dataframe containing a probly dataset.
Return type:: pandas.DataFrame

Notes

Sherman Kent, a CIA analyst, first published his work on the perception of probabilistic words in 1964 [1]. This exercise has been repeated several times since then. This function provides three different versions of the dataset, each originating from a different survey. Valid options for the version parameter are:

"zonination"

This is perhaps the most popular version of the dataset and originates from a survey conducted by the Reddit user /u/zonination.

Dataset details...

Creator	@zonination
Source	https://raw.githubusercontent.com/zonination/perceptions/51207062aa173777264d3acce0131e1e2456d966/probly.csv
Accessed on	2023-06-24

"wadefagen"

This version of the dataset originates from a blogpost by Wade Fagen-Ulmschneider from the University of Illinois [2]. It is based on a survey conducted on different social media platforms.

Dataset details...

Creator	Wade Fagen-Ulmschneider (@wadefagen)
Source	https://raw.githubusercontent.com/wadefagen/datasets/7e752937b72edc3126e3dd17e3cd97eb727af8f9/Perception-of-Probability-Words/survey-results.csv
Accessed on	2023-06-24

"illinois"

This version of the dataset originates from a survey of primarily undergraduate students conducted at The University of Illinois [3].

Dataset details...

Creator	University of Illinois
Source	https://waf.cs.illinois.edu/discovery/words.csv
Accessed on	2023-06-24

References