`wefe`.MulticlassHardDebias¶

class wefe.MulticlassHardDebias(pca_args: Dict[str, Any] = {'n_components': 10}, verbose: bool = False, criterion_name: Optional[str] = None)[source]¶

Generalized version of Hard Debias that enables multiclass debiasing.

Generalized refers to the fact that this method extends Hard Debias in order to support more than two types of social target sets within the definitional set. For example, for the case of religion bias, it supports a debias using words associated with Christianity, Islam and Judaism.

References

[1]: Manzini, T., Chong, L. Y., Black, A. W., & Tsvetkov, Y. (2019, June).
Black is to Criminal as Caucasian is to Police: Detecting and Removing Multiclass
Bias in Word Embeddings.
In Proceedings of the 2019 Conference of the North American Chapter of the
Association for Computational Linguistics: Human Language Technologies,
Volume 1 (Long and Short Papers) (pp. 615-621).
[2]: https://github.com/TManzini/DebiasMulticlassWordEmbedding

__init__(pca_args: Dict[str, Any] = {'n_components': 10}, verbose: bool = False, criterion_name: Optional[str] = None) → None[source]¶

Initialize a Multiclass Hard Debias instance.

Parameters

pca_argsDict[str, Any], optional: Arguments for the PCA that is calculated internally in the identification of the bias subspace, by default {“n_components”: 10}
verbosebool, optional: True will print informative messages about the debiasing process, by default False.
criterion_nameOptional[str], optional: The name of the criterion for which the debias is being executed, e.g. ‘Gender’. This will indicate the name of the model returning transform, by default None

fit(model: wefe.word_embedding_model.WordEmbeddingModel, definitional_sets: Sequence[Sequence[str]], equalize_sets: Sequence[Sequence[str]]) → wefe.debias.base_debias.BaseDebias[source]¶

Compute the bias direction and obtains the equalize embedding pairs.

Parameters

modelWordEmbeddingModel: The word embedding model to debias.
definitional_setsSequence[Sequence[str]]: A sequence of string pairs that will be used to define the bias direction. For example, for the case of gender debias, this list could be [[‘woman’, ‘man’], [‘girl’, ‘boy’], [‘she’, ‘he’], [‘mother’, ‘father’], …].
equalize_pairsOptional[Sequence[Sequence[str]]], optional: A list with pairs of strings, which will be equalized. In the case of passing None, the equalization will be done over the word pairs passed in definitional_sets, by default None.

Returns

BaseDebias: The debias method fitted.

transform(model: wefe.word_embedding_model.WordEmbeddingModel, target: Optional[List[str]] = None, ignore: Optional[List[str]] = None, copy: bool = True) → wefe.word_embedding_model.WordEmbeddingModel[source]¶

Execute Multiclass Hard Debias over the provided model.

Parameters

modelWordEmbeddingModel: The word embedding model to debias.
targetOptional[List[str]], optional: If a set of words is specified in target, the debias method will be performed only on the word embeddings of this set. If None is provided, the debias will be performed on all words (except those specified in ignore). by default None.
ignoreOptional[List[str]], optional: If target is None and a set of words is specified in ignore, the debias method will perform the debias in all words except those specified in this set, by default None.
copybool, optional: If True, the debias will be performed on a copy of the model. If False, the debias will be applied on the same model delivered, causing its vectors to mutate. WARNING: Setting copy with True requires RAM at least 2x of the size of the model, otherwise the execution of the debias may give rise to MemoryError, by default True.

Returns

WordEmbeddingModel: The debiased embedding model.

wefe.MulticlassHardDebias¶

`wefe`.MulticlassHardDebias¶