ridgeplot.datasets.load_probly#

ridgeplot.datasets.load_probly(version='zonination')[source]#

Load a version of the “Perception of Probability Words” (a.k.a., “probly”) dataset.

Parameters:: version – The version of the dataset to load. Valid options are "zonination" (default), "wadefagen", and "illinois". Each version is slightly different and originates from different surveys. See the Notes section for more details.
Returns:: A dataframe containing a probly dataset.
Return type:: pandas.DataFrame

Notes

Sherman Kent, a CIA analyst, first published his work on the perception of probabilistic words in 1964 [1]. This exercise has been repeated several times since then. This function provides three different versions of the dataset, each originating from a different survey. Valid options for the version parameter are:

"zonination"

This is perhaps the most popular version of the dataset and originates from a survey conducted by the Reddit user /u/zonination.

Dataset details...

Creator	zonination
Source	https://raw.githubusercontent.com/zonination/perceptions/51207062aa173777264d3acce0131e1e2456d966/probly.csv
Accessed on	2023-06-24

"wadefagen"

This version of the dataset originates from a blogpost by Wade Fagen-Ulmschneider from the University of Illinois [2]. It is based on a survey conducted on different social media platforms.

Dataset details...

Creator	Wade Fagen-Ulmschneider (wadefagen)
Source	https://raw.githubusercontent.com/wadefagen/datasets/7e752937b72edc3126e3dd17e3cd97eb727af8f9/Perception-of-Probability-Words/survey-results.csv
Accessed on	2023-06-24

"illinois"

This version of the dataset originates from a survey of primarily undergraduate students conducted at The University of Illinois [3].

Dataset details...

Creator	University of Illinois
Source	https://waf.cs.illinois.edu/discovery/words.csv
Accessed on	2023-06-24

References