ANNIS, which stands for ANNotation of Information Structure, is an open source search and visualization tool that focuses on multilayer linguistic corpora. It is thus the ideal tool for queries in the part of our corpus that was normalized and PoS annotated. It allows to search the same corpus in multiple layers.
The query syntax used by ANNIS has a steep learning curve as is the case with most tools that offer an abundance of options. But ANNIS also offers an excellent tutorial that is built into the browsing tool. We therefore refrain from writing an additional manual here. Instead, we only show you how to get started and how to find the built in manual. If you do not want to read the manual but just get started, we recommend you to have a look at a few examples that we put together based on our own corpus. This might give you a good idea about how to create your own queries.
Getting startedWhen you start ANNIS, you get a screen similar to this one:![]() ![]() CorporaNon-dialectal German
French
Swiss German dialect
Italian
Romansh
Layers, labels and valuesOur corpus, as it is available in ANNIS, is build up of different layers, which can be queried and (except formeta ) displayed. Here is an example:
Query: abbrev ="" & lang = "eng" & #1.#2
as you can see, the expressions searched for are highlighted.
![]()
pos layer, you can use the following tag sets:
Sample queriesBeni Ruef was as kind as to put together these queries from all our subcorpura. If you have a close look at the syntax, you will learn how to build your own queries by combining the individual elements. Male teenagers writing about school:meta::sex="M" & meta::age=/1[3-9]/ & lemma="Schule"All French nonce borrowings (in German, Romansh or French subcorpus): lang="fra"The three tokens es hat noch in exactly this order: /[Ee]s/ & "hat" & "noch" & #1.#2 & #2.#3All spelling variants of gesagt: gloss="gesagt"Typical Swiss German possessive construction (e.g. em Hans sis Hus ≈ dem Hans sein Haus): pos="ART" & pos=/(NE|NN)/ & pos="PPOSAT" & pos="NN" & #1.#2 & #2.#3 & #3.#4Emphasis using the personal pronoun (1sg) before the verb in Italian: lemma="io" & pos=/VER:.+/ & #1.#2All Swiss German nonce borrowings from Puter speakers: lang="gsw" & meta::lang_main="roh-pt" |
On this page:
You might also be interested in: Please don't forget to quote the corpus in your work. |