wefe.datasets
.fetch_eds
- wefe.datasets.fetch_eds(occupations_year: int = 2015, top_n_race_occupations: int = 10) Dict[str, List[str]] [source]
- Fetch the sets of words used in the experiments of the _Word Embeddings
Quantify 100 Years Of Gender And Ethnic Stereotypes_ work.
This dataset includes the following word sets: - gender: male, female. - ethnicity: asian, black, white. - religion: christianity, judaism and islam. - adjetives: appearence, intelligence, otherization, sensitive.
- Parameters:
- occupations_yearint, optional
The year of the census for the occupations file. Available years: {‘1850’, ‘1860’, ‘1870’, ‘1880’, ‘1900’, ‘1910’, ‘1920’, ‘1930’, ‘1940’, ‘1950’, ‘1960’, ‘1970’, ‘1980’, ‘1990’, ‘2000’, ‘2001’, ‘2002’, ‘2003’, ‘2004’, ‘2005’, ‘2006’, ‘2007’, ‘2008’, ‘2009’, ‘2010’, ‘2011’, ‘2012’, ‘2013’, ‘2014’, ‘2015’} , by default 2015
- top_n_race_occupationsint, optional
The year of the census for the occupations file. The number of occupations by race, by default 10
- Returns:
- dict
A dictionary with the word sets.
References
[1]: Word Embeddings quantify 100 years of gender and ethnic stereotypes.Garg, N., Schiebinger, L., Jurafsky, D., & Zou, J. (2018).Proceedings of the National Academy of Sciences, 115(16), E3635-E3644.