Esquite

Framework for parallel corpora

What is Esquite?

Esquite is a framework intended for people who have parallel corpus (bilingual texts) and wish to get a web system that allows them to upload documents, manage them and perform queries based on words and phrases in both languages.

Features

  • Perform advanced queries in your parallel corpus thanks to the search engine Elasticsearch
  • Manage your documents through the corpus administrator
  • Customization of the Web Client
    • Colors
    • Keyboard with special characters (useful for non-english languages)
    • Add custom html information to the views: help, about corpus, links, etc.
  • New features in development
Main screens of the web framework. You can search by languages (On the left) and download the results as .csv (on the right)

Instances

  • Tsunkua: Otomí-Spanish parallel corpus
  • Kolo: Mixteco-Spanish parallel corpus
  • Axolotl: Náhuatl-Spanish corpus (Old version)

Docker image alternative: Esquite-Docker

Alternatively, it is possible to use Esquite and deploy it in an easier way by using our official Docker image.

Detailed documentation is available on:

Contact

Are you a speaker/researcher of a minority language and would like to upload your parallel corpus? Contact us: contacto at elotl.mx

Collaborators

  • Collaborator: Xim (@XimGutierrez) - xim at unam.mx
  • Mantainer: Diego B. (@umoqnier) - dbarriga at ciencias.unam.mx
  • DevOps: Javier (@jusafing) - jusafing at jusanet.org

Community