A ClowdFlows package, which contains widgets for stream mining. The package can also be used with ClowdFlows 2.0.
Currently, the project contains components for different corpus operations, basic natural language processing operations such as tokenization, stop word removal, lemmatization, part-of-speech tagging, etc. It also has modules for tweet streaming, term extraction and gender classification.
pip install cf_streaming
Please find other installation instructions, examples and API reference on Read the Docs.
Please note that this is a research project and that drastic changes can be (and are) made pretty regularly. Changes are documented in the CHANGELOG.
Pull requests and issues are welcome.
Janez Kranjc (@janezkranjc) Anže Vavpetič (@anzev)
- Knowledge Technologies Department, Jožef Stefan Institute, Ljubljana