User Tools

Site Tools


02_browsing:04_queries

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
Next revisionBoth sides next revision
02_browsing:04_queries [2020/01/06 17:01] simone02_browsing:04_queries [2021/11/29 15:28] – external edit 127.0.0.1
Line 1: Line 1:
 ====== 2.4 Queries ====== ====== 2.4 Queries ======
-You can query the corpus for very different pieces of information such as texts written in the chats, part of speech annotations, demographic information like the age of the informant, or statistical information like the number of messages in an chat.+You can query the corpus for very different pieces of information such as messages written in the chats, part of speech annotations, demographic information like the age of the informant, or statistical information like the number of messages in chat.
  
-Please keep in mind that all the fields in the corpus are text fields. For your query that means that you cannot search for //larger// or //smaller than//. For example, you cannot say "show me all chats with more than 1000 messages", because this field is interpreted as text and not as a digit. In theory, you would have to query for messages with 1000, 1001, 1002, 1003 etc. messages. In practice, this is not a very useful chriterium for a query.+Please keep in mind that all the fields in the corpus are text fields. 
  
 The following three options for querying the corpus are described in more detail in the sub-sections of this document: The following three options for querying the corpus are described in more detail in the sub-sections of this document:
-  * [[02_browsing:04_queries:01_simple_queries|Simple queries]]: these are basically queries for words e.g. //est// or //ich// etc. +  * [[02_browsing:04_queries:02_simple_queries|Simple queries]]: These are basically queries for words e.g. //est// or //ich// etc. 
-  * [[02_browsing:04_queries:02_regex|RegEx queries]]: are used for more complex patterns such as alternatives (//man// and //men//), for patterns with different endings (//Man// and //Manchester//) etc.+  * [[02_browsing:04_queries:03_regex|RegEx queries]]: These are used for more complex patterns such as alternatives (//man// and //men//), for patterns with different endings (//Man// and //Manchester//) etc.
   * Queries for meta data   * Queries for meta data
-  * Combined queries: are used whenever you want information from different [[02_browsing:02_layers|layers]], e.g. the word //man// written by only females.+  * Combined queries: These are used whenever you want information from different [[02_browsing:02_layers|layers]], e.g. the word //man// written by only females.
  
-Please remember to always keep in mind the unit that you are querying. If you query in individual tokens, you do not have to consider separators such as spaces, punctuation, tabs etc. If, on the other hand, you work on a whole message, you have to take such things into account. You also have to remember that querying over whole messages is very slow and can end in errors when the server is overloaded.+Please remember to always keep in mind the unit that you are querying. If you query in individual tokens, you do not have to consider separators such as spaces, punctuation, tabs etc. If, on the other hand, you work on a whole message, you have to take such things into account. You also have to remember that querying over whole messages is very slow and can lead to time outs.
  
02_browsing/04_queries.txt · Last modified: 2022/06/27 09:21 by 127.0.0.1

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki