longformer

Master thesis with code investigating methods for incorporating long-context reasoning in low-resource languages, without the need to pre-train from scratch. We investigated if multilingual models could inherit these properties by making it an Efficient Transformer (s.a. the Longformer architecture).

multilingual language-models xlm roberta huggingface multilingual-models longformer huggingface-transformers longformer-models efficient-transformers

UpdatedAug 19, 2021
Jupyter Notebook

wjunneng /2020-AI-Financial-User-Review-Categories

Star14

2020 AI研习社金融用户评论分类

text-classification bert longformer

UpdatedMay 17, 2020
Python

LennartKeller /roberta2longformer

Star11

Convert pretrained RoBerta models to various long-document transformer models

nlp transformers pytorch language-model roberta huggingface longformer huggingface-transformers roberta-model longformer-models big-bird nystromformer

UpdatedApr 5, 2022
Python

nsi319 /Legal-Summarizer

Star10

Longformer Encoder Decoder model for the legal domain, trained for long document abstractive summarization task.

legal summarization seq2seq transfer-learning encoder-decoder abstractive-summarization huggingface longformer

UpdatedFeb 26, 2021

kbulutozler /transformers-text-classification

Star10

using transformers to do text classification.

nlp text-classification transformers bert huggingface longformer

UpdatedNov 10, 2021
Jupyter Notebook

dmamakas2000 /ipo

Star6

This GitHub repository implements a novel approach for detecting Initial Public Offering (IPO) underpricing using pre-trained Transformers. The models, extended to handle large S-1 filings, leverage both textual information and financial indicators, outperforming traditional machine learning methods.

python nlp finance ai deep-learning pytorch shares bert finance-application capital ipo longformer initial-public-offering llms

UpdatedDec 2, 2024
Python

sidharrth2002 /text-scoring

Star5

Industrial Text Scoring using Multimodal Deep Natural Language Processing 🚀 | Code for IEA AIE 2022 paper

nlp transformers longformer

UpdatedJan 1, 2023
Python

KimJaehee0725 /YoYAK

Star5

[제 13회 투빅스 컨퍼런스] YoYAK - Yes or Yes, Attention with gap-sentence for Korean long sequence

nlp transformers pytorch bart summarization pegasus korean-nlp longformer long-sequence

UpdatedApr 4, 2022
Jupyter Notebook

karthiikJR /video-pdf-summarization

Star4

A summarization website that can generate summaries from either YouTube videos or PDF files.

python flask reactjs transformers summarization tailwindcss longformer

UpdatedSep 25, 2024
Python

Bakhitovd /led-base-7168-ml

Star4

Fine-tuned Longformer for Summarization of Machine Learning Articles

summarization fine-tuning scientific-papers longformer

UpdatedJun 22, 2023
Jupyter Notebook

akuritsyn /feedback-prize-2021

Star4

Kaggle NLP competition - Top 2% solution (36/2060)

nlp transformers pytorch longformer deberta

UpdatedMar 17, 2022
Jupyter Notebook

Attention-is-All-We-Need /Document-Summary-Generator

Star4

nlp flask youtube-api requests web-scraping summarization cosine-similarity html-css-javascript bert hackathon-project beautifulsoup4 hnswlib longformer generative-ai

UpdatedAug 30, 2024
Python

OthmanMohammad /Longformer-Learning-Next-Generation-Sentiment-Analysis

Star4

This project applies the Longformer model to sentiment analysis using the IMDB movie review dataset. The Longformer model, introduced in "Longformer: The Long-Document Transformer," tackles long document processing with sliding-window and global attention mechanisms. The implementation leverages PyTorch, following the paper's architecture

nlp transformers pytorch longformer

UpdatedApr 7, 2023
Python

Vladislavlhp7 /lay_summarisation

Star3

Project as part of COMP34812: Natural Language Understanding

transformer summarization gpt-2 longformer

UpdatedMay 5, 2023
Jupyter Notebook

AbineshSivakumar /HyperPartisan_Classification_Using_BERT

Star1

A hyperpartisan news article classification system using BERT-based techniques. The goal was to leverage state-of-the-art transformer models like BERT, ROBERTa, and Longformer to accurately classify news articles as hyperpartisan or non-hyperpartisan.

text-classification bert hyperpartisan-news-detection longformer large-language-models

UpdatedOct 24, 2023
Jupyter Notebook

hperer02 /PII-data-detection

Star1

This project was developed for a Kaggle competition focused on detecting Personally Identifiable Information (PII) in student writing. The primary objective was to build a robust model capable of identifying PII with high recall. The DeBERTa v3 transformer model was chosen for this task after comparing its performance with other transformer models.

natural-language-processing spacy-nlp pii-detection name-entity-recognition longformer huggingface-transformers roberta-model deberta-v3-large