Movatterモバイル変換


[0]ホーム

URL:


Balachandran, 2023 - Google Patents

Tamil-llama: A new tamil language model based on llama 2

Balachandran, 2023

ViewPDF
Document ID
14499910178143428093
Author
Balachandran A
Publication year
Publication venue
arXiv preprint arXiv:2311.05845

External Links

Snippet

Language modeling has witnessed remarkable advancements in recent years, with Large Language Models (LLMs) like ChatGPT setting unparalleled benchmarks in human-like text generation. However, a prevailing limitation is the underrepresentation of languages like …
Continue reading atarxiv.org (PDF) (other versions)

Classifications

The classifications are assigned by a computer and are not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the classifications listed.
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/28Processing or translating of natural language
    • G06F17/2809Data driven translation
    • G06F17/2827Example based machine translation; Alignment
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/27Automatic analysis, e.g. parsing
    • G06F17/2765Recognition
    • G06F17/277Lexical analysis, e.g. tokenisation, collocates
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/3061Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F17/30634Querying
    • G06F17/30657Query processing
    • G06F17/30675Query execution
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/28Processing or translating of natural language
    • G06F17/2872Rule based translation
    • G06F17/2881Natural language generation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/21Text processing
    • G06F17/22Manipulating or registering by use of codes, e.g. in sequence of text characters
    • G06F17/2217Character encodings
    • G06F17/2223Handling non-latin characters, e.g. kana-to-kanji conversion
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/27Automatic analysis, e.g. parsing
    • G06F17/2705Parsing
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/28Processing or translating of natural language
    • G06F17/289Use of machine translation, e.g. multi-lingual retrieval, server side translation for client devices, real-time translation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/28Processing or translating of natural language
    • G06F17/2863Processing of non-latin text

Similar Documents

PublicationPublication DateTitle
BalachandranTamil-llama: A new tamil language model based on llama 2
US10769387B2 (en)System and method for translating chat messages
US20140163952A1 (en)Conversion of input text strings
Ahmadnia et al.Augmenting neural machine translation through round-trip training approach
US20220383159A1 (en)Systems and methods for open domain multi-hop question answering
CN116611448A (en) Method and device for generating emotional text based on hint learning and masked language model
Li et al.Improving text normalization using character-blocks based models and system combination
Ballesteros et al.Greedy transition-based dependency parsing with stack lstms
Muniraj et al.HNTSumm: Hybrid text summarization of transliterated news articles
Ming et al.Marco-llm: Bridging languages via massive multilingual training for cross-lingual enhancement
Chen et al.Sentence-level heuristic tree search for long text generation
Meyer et al.Subword segmental machine translation: Unifying segmentation and target sentence generation
Palomino et al.Palomino-Ochoa at TASS 2020: Transformer-based Data Augmentation for Overcoming Few-Shot Learning.
Barteld et al.Dealing with word-internal modification and spelling variation in data-driven lemmatization
Mammadov et al.Part-of-speech tagging for azerbaijani language
Lafkiar et al.An Arabic question generation system based on a shared BERT-base encoder-decoder architecture
Elakkiya et al.Fine tuning pretrained transformers for abstractive news summarization
Le et al.Towards a low-resource neural machine translation for indigenous languages in Canada
EphremDevelopment of Bidirectional Amharic-Tigrinya Machine Translation Using Recurrent Neural Networks
Ogawa et al.Japanese Particle Error Correction employing Classification Model
Chouhan et al.HindiLLM: Large Language Model for Hindi
Billah et al.Towards santali linguistic inclusion: building the first Santali-to-English translation model using mT5 transformer and data augmentation
Linn et al.Statistical machine translation between Myanmar (Burmese) and Kayah
Noever et al.Local translation services for neglected languages
Raihan et al.Mixed-distil-bert: Code-mixed language modeling for bangla, english, and hindi

[8]
ページ先頭

©2009-2025 Movatter.jp