Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
#

text-data

Here are 65 public repositories matching this topic...

Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project:http://casl-project.ai/

  • UpdatedAug 26, 2021
  • Python

Integrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CASL project:http://casl-project.ai/

  • UpdatedApr 14, 2022
  • Python

Forte is a flexible and powerful ML workflow builder. This is part of the CASL project:http://casl-project.ai/

  • UpdatedFeb 5, 2024
  • Python

Conversational Toolkit. An Open-Source Toolkit for Fast Development and Fair Evaluation of Text Generation

  • UpdatedAug 31, 2020
  • Python

Cleans Reddit Text Data 📜 🧹

  • UpdatedApr 14, 2020
  • Python

Tools to uniformly read in text data including semi-structured transcripts

  • UpdatedMar 7, 2023
  • R

A Python library that enables smooth keyword extraction from any text using the RAKE(Rapid Automatic Keyword Extraction) algorithm.

  • UpdatedMay 3, 2024
  • Python

Question Classification for the dataset CogComp QC Dataset - [http://cogcomp.org/Data/QA/QC/ ].

  • UpdatedNov 10, 2020
  • Python

Visualize large text collections with WebGL

  • UpdatedSep 4, 2024
  • JavaScript

Presents an optimized Apache Beam pipeline for generating sentence embeddings (runnable on Cloud Dataflow).

  • UpdatedMar 7, 2022
  • Python

Old book pages (with groundtruth), formerly used for OCR studies. There are several versions of the set (concerning resolution and binarization). Noised and denoised sets (done by several methods) are eventually going to be uploaded.

  • UpdatedAug 25, 2017
  • HTML

Scrape EDGAR filings fromhttps://www.sec.gov/

  • UpdatedMar 10, 2025
  • Julia

A dataset which contains 30k+ so called "self-help" tweets from 100+ authors.

  • UpdatedOct 12, 2019
  • Jupyter Notebook

This repository hosts a diverse NLP dataset comprising 1,000 stories spanning 100 genres for comprehensive language understanding tasks.

  • UpdatedDec 9, 2023

곰tv 자막 데이터 수집 코드

  • UpdatedFeb 10, 2017
  • R

A Python package implementing the Directed LDA model for targeted extraction of specific topics from text data

  • UpdatedJan 12, 2025
  • Python

Improve this page

Add a description, image, and links to thetext-data topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with thetext-data topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2025 Movatter.jp