Monolingual text data
Multilingual text data
Multimodal text+image
Multimodal text+speech