What's up, Switzerland?

User Tools

Site Tools


This is an old revision of the document!

2.3 Queries

You can query the corpus for very different pieces of information such as texts written in the chats, part of speech annotations, demographic information like the age of the informant, or statistical information like the number of messages in an chat.

Please keep in mind that all the fields in the corpus are text fields. For your query that means that you cannot search for larger or smaller than. For example, you cannot say "show me all chats with more than 1000 messages", because this field is interpreted as text and not as a digit. In theory, you would have to query for messages with 1000, 1001, 1002, 1003 etc. messages. In practice, this is not a very useful chriterium for a query.

The following three options for querying the corpus are described in more detail in the sub-sections of this document:

  • Simple queries: these are basically queries for words e.g. est or ich etc.
  • RegEx queries: are used for more complex patterns such as alternatives (man and men), for patterns with different endings (Man and Manchester) etc.
  • Combined queries: are used whenever you want information from different layers, e.g. the word man written by only females.
02_browsing/04_queries.1573125295.txt.gz · Last modified: 2021/11/29 15:29 (external edit)