Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up

Teal deer (from TL;DR) helps you get the gist of all the stuff you need to read, so you don't have to read it all at once.

NotificationsYou must be signed in to change notification settings

DeepLearningSky/teal_deer

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 

Repository files navigation

TLDR_LDA_and_Text_Summarization.ipynb is the primary current notebook.

Currently just hacking notebook. However, the notebook scrapes text from a directory of academic research pdf's,and then does LDA on it for prioritization of reading. Dataset for this run included just a handful of paperson chatbots from arxiv. OCR portion relies on:https://github.com/euske/pdfminer/blob/master/tools/pdf2txt.py

In process:
Adding a text summarization feature to try to generate abstracts or short summaries for large blocksof text (i.e., an abstract for the rest of a paper). So, not only could papers be prioritized, but could besummarized as well.

Planned updates - See project tab as well:

  • Finish out OCR from PDF files part
  • Complete the text summarization portion - Thanks to Siraj Raval for making the video:https://www.youtube.com/watch?v=ogrJaOIuBx4
  • Clean up into python scripts with test suites
  • Experiment with other front-end usecases: i.e., a slackbot is currently underway (notebook to be added later).
  • Add a CI framework into this repo.
  • Cartoon for a fun logo :-)

About

Teal deer (from TL;DR) helps you get the gist of all the stuff you need to read, so you don't have to read it all at once.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook100.0%

[8]ページ先頭

©2009-2025 Movatter.jp