quanteda
is in development and will remain so until we declare a 1.0 version, at which time we will only add new functions, not change the names of existing ones. In the meantime, we suggest:
clean(mytexts, removeDigits=TRUE)
instead of clean(mytexts, TRUE)
;Define full set of operators for dfmSparse and dfmDense. Right now, only +
is defined.
Common import syntax for dictionary imports.
Debug, make robust, and add funtionality to textfile()
.
settings
resample
index
textmodel
Integrate C++ versions of tokenize and clean.
Wordfish C++.