
The video introduces you to the PARSEME Ancient Greek working group and this mini-course on corpus annotation.
The video introduces you to the annotation guidelines and the annotation process. Please read through the annotation guidelines before watching the video: https://parsemefr.lis-lab.fr/parseme-st-guidelines/1.3/index.php?page=home. You can toggle to GRC at the top in order to see classical Greek examples.
This video is a demo of annotating the beginning of Lysias, Speech 1 in the FLAT annotation platform. Please download the text before you start watching the video.
This is a practice exercise for you to annotate Lysias, Speech 1 according to the PARSEME 1.3 and GRC working group (TODO) guidelines. Please use the word document attached to do this. I suggest using different colours for each structure as you would also do on FLAT. If you want to check how you did, please head to https://parseme.grew.fr/?corpus=PARSEME-FR@dev and find the Ancient Greek corpus (once released). You can also look at the PDF showing our pre-release group annotations. You will notice that the Perseus language model used to do the parsing does not work perfectly such that the lemmata drawn out are (perhaps) not what you expect. Furthermore, the document is the pre-release stage of comparing different annotator's work, please bear this in mind.
The video summarises what we have done and offers information about how to get involved in the annotation effort along with the activities of the working group.
The course introduces the audience to the PARSEME 1.3 annotation effort for classical Greek. The annotation effort is interested in verbal multi-word expressions such as to spill the beans, to make a suggestion, and to make do in a large range of languages, primarily spoken languages currently. Verbal multi-word expressions are currently underrepresented in standard dictionaries and grammar books of classical Greek. The course showcases what being an annotator in the initiative looks like, what difficulties exist, and how we overcome them, and what we can do with the annotated data on the research side subsequently. All the links to guidelines and all texts to be worked on are supplied in the course resources. Recommendations for external resources to use are provided. The course is intended not only for those interested to join the initiative but also for those interested in Greek linguistics and specifically verbal multi-word expressions especially coming to this from a natural language processing background. The course is a hands-on tutorial for the corpus annotation rather than a linguistic review of verbal multi-word expressions and requires takers to have a good level of knowledge in classical Greek (ideally Attic dialect), we recommend A-level equivalent or above.