ANNIS, which stands for ANNotation of Information Structure, is an open source search and visualization tool that focuses on multilayer linguistic corpora. It is thus the ideal tool for queries in the part of our corpus that was normalized and PoS annotated. It allows to search the same corpus in multiple layers.
The query syntax used by ANNIS has a steep learning curve as is the case with most tools that offer an abundance of options. But ANNIS also offers an excellent tutorial that is built into the browsing tool. We therefore refrain from writing an additional manual here. Instead, we only show you how to get started and how to find the built in manual. If you do not want to read the manual but just get started, we recommend you to have a look at a few examples that we put together based on our own corpus. This might give you a good idea about how to create your own queries.
Getting startedWhen you start ANNIS, you get a screen similar to this one: Because you are not logged in (1), you you only see the Whatsapp data in field (2). Make sure that you are a registred user and log in (1) with the credential that you find on the starting page. Now, your screen changes slightly: In the bottom left, you can now select the subcorpus, with which you want to work (1). Corpora starting in WUS_ are Whatsapp, lower case names are SMS. In the top left (2), you can type in your query or create one with the tools provided. In the center, you see some examples, until you type in your own query in (2). You can come back to these examples as well as to the tutorial any time by pressing on Tutorial (4).
Swiss German dialect
Layers, labels and valuesOur corpus, as it is available in ANNIS, is build up of different layers, which can be queried and (except for
Sample queriesBeni Ruef was as kind as to put together these queries from all our subcorpura. If you have a close look at the syntax, you will learn how to build your own queries by combining the individual elements. Male teenagers writing about school:
meta::sex="M" & meta::age=/1[3-9]/ & lemma="Schule"All French nonce borrowings (in German, Romansh or French subcorpus):
lang="fra"The three tokens es hat noch in exactly this order:
/[Ee]s/ & "hat" & "noch" & #1.#2 & #2.#3All spelling variants of gesagt:
gloss="gesagt"Typical Swiss German possessive construction (e.g. em Hans sis Hus ≈ dem Hans sein Haus):
pos="ART" & pos=/(NE|NN)/ & pos="PPOSAT" & pos="NN" & #1.#2 & #2.#3 & #3.#4Emphasis using the personal pronoun (1sg) before the verb in Italian:
lemma="io" & pos=/VER:.+/ & #1.#2All Swiss German nonce borrowings from Puter speakers:
lang="gsw" & meta::lang_main="roh-pt"
On this page:
You might also be interested in:
Please don't forget to quote the corpus in your work.