corpus-linguistics
Here are 343 public repositories matching this topic...
Language:All
Sort:Most stars
An Integrated Corpus Tool With Multilingual Support for the Study of Language, Literature, and Translation
- Updated
Mar 27, 2025 - Python
My book list
- Updated
Jul 7, 2024
A Curated List of Dataset and Usable Library Resources for NLP in Bahasa Indonesia
- Updated
Feb 17, 2023
Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German
- Updated
Oct 30, 2024
A list of Indonesian NLP resources.
- Updated
Jan 18, 2022
A curated list of NLP resources for Hungarian
- Updated
Apr 12, 2025
A web-based engine for creating and annotating textual corpora
- Updated
Aug 26, 2023 - PHP
data resource untuk NLP bahasa indonesia
- Updated
Sep 19, 2020
Crawler for linguistic corpora
- Updated
Dec 5, 2023 - Python
🕷️ The pipeline for the OSCAR corpus
- Updated
Dec 18, 2023 - Rust
Kanji usage frequency data collected from various sources
- Updated
Apr 6, 2025 - Astro
Data for the quantitative study of (Vedic) Sanskrit
- Updated
Apr 12, 2025 - Python
Quran, Hadith, Translations, Tafaseer, Corpus Linguistics. Everything for NLP
- Updated
Apr 9, 2024 - Jupyter Notebook
An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.
- Updated
Apr 21, 2021 - Go
Large silver standart Russian corpus with NER, morphology and syntax markup
- Updated
Jul 24, 2023 - Python
An advanced, extensible web front-end for the Manatee-open corpus search engine
- Updated
Apr 29, 2025 - TypeScript
A textual corpus database for the digital humanities.
- Updated
Jul 26, 2020 - Jupyter Notebook
SpeCT - Speech Corpus Toolkit for Praat. Documentation:https://lennes.github.io/spect/
- Updated
Aug 11, 2023 - HTML
A large high-quality corpus of Chinese synonyms 一个大型、高质量的中文同义词语料库。
- Updated
Nov 20, 2021 - Jupyter Notebook
My solutions to selected exercises to "Natural Language Processing with Python – Analyzing Text with the Natural Language Toolkit" by Steven Bird, Ewan Klein, and Edward Loper.
- Updated
Dec 5, 2019 - Jupyter Notebook
Improve this page
Add a description, image, and links to thecorpus-linguistics topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thecorpus-linguistics topic, visit your repo's landing page and select "manage topics."