languagemodel
Here are 24 public repositories matching this topic...
Language:All
Sort:Most stars
CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.
- Updated
Jan 31, 2025 - Python
🔥 🔥 🔥Open Source & AI driven Data Onboarding Platform:Free flatfile.com alternative
- Updated
Aug 3, 2023 - JavaScript
This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (NeurIPS22).
- Updated
Jan 21, 2025 - Python
[NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation
- Updated
Mar 7, 2024 - Python
Calculate perplexity on a text with pre-trained language models. Support MLM (eg. DeBERTa), recurrent LM (eg. GPT3), and encoder-decoder LM (eg. Flan-T5).
- Updated
Oct 1, 2024 - Python
Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate"
- Updated
Feb 11, 2025 - Python
高性能小模型测评 Shared Tasks in NLPCC 2020. Task 1 - Light Pre-Training Chinese Language Model for NLP Task
- Updated
Jun 1, 2020 - Python
Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polish Language
- Updated
Dec 3, 2023 - Python
TurkishBERTweet: Fast and Reliable Large Language Model for Social Media Analysis
- Updated
Sep 22, 2024 - Python
The PreTENS shared task hosted at SemEval 2022 aims at focusing on semantic competence with specific attention on the evaluation of language models with respect to the recognition of appropriate taxonomic relations between two nominal arguments (i.e. cases where one is a supercategory of the other, or in extensional terms, one denotes a superset…
- Updated
Feb 5, 2022 - Jupyter Notebook
Code for "Semi-supervised Formality Style Transfer using Language Model Discriminator and Mutual Information Maximization"
- Updated
Oct 4, 2020 - Python
A 78.5% word sense disambiguator based on Transformers and RoBERTa (PyTorch)
- Updated
Jun 9, 2020 - Python
Informal to formal dataset mask MLM
- Updated
Mar 6, 2025 - Python
A project that harnesses the Stanford NLP library to gauge sentiment from provided text via an intuitive graphical interface.
- Updated
Aug 20, 2023 - Java
translatorlab: a machine translation tool that uses artificial intelligence models to provide accurate and fast translations between different languages
- Updated
Feb 26, 2025 - Python
This project scrapes and cleans Shakespeare's public domain texts, trains a character-level LSTM model in PyTorch, and generates fresh, Shakespeare-like text. Perfect for literature and NLP enthusiasts, it provides metrics (loss, perplexity, accuracy) and a platform for tuning hyperparameters and exploring the art of AI-driven language modeling.
- Updated
Dec 19, 2024 - Jupyter Notebook
The PowerShell Random Text Generator is a script that generates random text based on a given model.
- Updated
Jul 7, 2024 - PowerShell
The project generates a sentence given a pre-defined starting phrase from the user such as "Ilbierah kont" and the script attempts to build a sentence off of that phrase. Structurally, the generator works in an n-gram fashion but the main structures used to generate the sentences were the unigram, bigram and trigram. The perplexity for each n-gr…
- Updated
Aug 11, 2021 - Jupyter Notebook
Improve this page
Add a description, image, and links to thelanguagemodel topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thelanguagemodel topic, visit your repo's landing page and select "manage topics."