Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

LanguageMachines

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
@LanguageMachines

Language Machines

NLP Research group at Centre for Language Studies, Radboud University Nijmegen

Popular repositoriesLoading

  1. frogfrogPublic

    Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl, the Tilburg memory-based learning software package.

    C++ 75 11

  2. uctouctoPublic

    Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic preprocessing steps such as changing case that you can all use…

    C++ 68 14

  3. timbltimblPublic

    TiMBL implements several memory-based learning algorithms.

    C++ 51 9

  4. PICCLPICCLPublic

    A set of workflows for corpus building through OCR, post-correction and normalisation

    Python 48 7

  5. LuigiNLPLuigiNLPPublic

    A workflow system for Natural Language Processing.

    Python 21 5

  6. libfolialibfoliaPublic

    FoLiA library for C++

    C++ 16 6

Repositories

Loading
Type
Select type
Language
Select language
Sort
Select order
Showing 10 of 54 repositories
  • libfolia Public

    FoLiA library for C++

    LanguageMachines/libfolia’s past year of commit activity
    C++ 16GPL-3.0 6 5 0 UpdatedFeb 28, 2025
  • ucto Public

    Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic preprocessing steps such as changing case that you can all use to make your text suited for further processing such as indexing, part-of-speech tagging, or machine translation. Ucto comes with tokenisation rules …

    LanguageMachines/ucto’s past year of commit activity
    C++ 68GPL-3.0 14 12 0 UpdatedFeb 8, 2025
  • ticcutils Public

    Ticcutils, a generic utility library shared by our software.

    LanguageMachines/ticcutils’s past year of commit activity
    C++ 7GPL-3.0 9 1 0 UpdatedJan 24, 2025
  • wopr Public

    Memory Based Word Predictor/Language Modelhttp://ilk.uvt.nl/wopr/

    LanguageMachines/wopr’s past year of commit activity
    C++ 50 1 0 UpdatedDec 24, 2024
  • toad Public

    Toad: Trainer Of All Data, the Frog training collection

    LanguageMachines/toad’s past year of commit activity
    C++ 1GPL-3.0 2 1 0 UpdatedDec 23, 2024
  • foliatest Public

    Test suite for libfolia

    LanguageMachines/foliatest’s past year of commit activity
    C++0GPL-3.0 2 0 0 UpdatedDec 20, 2024
  • foliautils Public

    Command-line utilities for working with the Format for Linguistic Annotation (FoLiA), powered by libfolia (C++), written by Ko van der Sloot (CLST, Radboud University)

    LanguageMachines/foliautils’s past year of commit activity
    C++ 4GPL-3.0 3 8 0 UpdatedDec 16, 2024
  • timbl Public

    TiMBL implements several memory-based learning algorithms.

    LanguageMachines/timbl’s past year of commit activity
    C++ 51GPL-3.0 9 1 0 UpdatedDec 16, 2024
  • ticcltools Public

    Tools for TICCL

    LanguageMachines/ticcltools’s past year of commit activity
    C++ 14GPL-3.0 4 17 0 UpdatedDec 16, 2024
  • dimbl Public

    Distributed Tilburg Memory Based Learner

    LanguageMachines/dimbl’s past year of commit activity
    C++ 2GPL-3.0 2 0 0 UpdatedDec 16, 2024

Top languages

Loading…

Most used topics

Loading…


[8]ページ先頭

©2009-2025 Movatter.jp