- provides regular expression (re) matching methods on a more complex structure than a list of characters (string), namely a sequence of features set (i.e.
list
ofdict
in python jargon); - is free from the information encapsulated in the features and consequently can work with word features, sentences features, calendar event features… Indeed, PyRATA is not only dedicated to process textual data.
- is fun and easy to use to explore data for research study, solve deterministic problems, formulate expert knowledge in a declarative way, prototype quickly models and generate training data for Machine Learning (ML) systems, extract ML features, augment ML models…
Download the sources (under Apache v2 license)