lrec2026-llm-as-annotator-tutorial

LREC-COLING 2026 tutorial notebooks

Colab-ready notebooks for the tutorial Under-Resourced Studies of Under-Resourced Languages: Practical, Reproducible LLM-as-Annotator Pipelines Across Scripts and Domains.

Run order:

  1. 00_setup_and_data.ipynb
  2. 01_prompting_zero_few_shot.ipynb
  3. 02_structured_outputs_and_validation.ipynb
  4. 03_evaluation_and_error_analysis.ipynb
  5. 04_sampling_and_bootstrapping.ipynb

The notebooks default to USE_API = False; they therefore run without API keys using deterministic fallback predictions. Replace the toy data with validated project data before using the pipeline for research.