2020
Software  Unknown

PyDRO: A Python reimplementation of the Distributional Random Oversampling method for binary text classification

Moreo Fernandez A. D.

Python  Distributional Random Oversampling  Imbalanced Classification  Binary Classification 

This repo is a stand-alone (re)implementation of the Distributional Random Oversampling (DRO) method presented in SIGIR'16. The former implementation was part of the JaTeCs framework for Java. Distributional Random Oversampling (DRO) is an oversampling method to counter data imbalance in binary text classification. DRO generates new random minority-class synthetic documents by exploiting the distributional properties of the terms in the collection. The variability introduced by the oversampling method is enclosed in a latent space; the original space is replicated and left untouched.



Back to previous page
BibTeX entry
@misc{oai:it.cnr:prodotti:415606,
	title = {PyDRO: A Python reimplementation of the Distributional Random Oversampling method for binary text classification},
	author = {Moreo Fernandez A. D.},
	year = {2020}
}
CNR ExploRA

Bibliographic record

Also available from

github.com