Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

A comprehensive desktop application designed for intelligent text analysis and linguistic processing. Built with Python and tkinter, it provides advanced natural language processing capabilities including word frequency analysis, named entity recognition, pattern extraction, and multi-language support with automatic language detection.

License

NotificationsYou must be signed in to change notification settings

TALOS-AI4SSH/Text-File-Analyser

Repository files navigation

PythonTkinterspaCyStanza

Available languages:English |Ελληνικά


This repository provides a lightweightdesktop GUI application for analyzing, extracting, and exporting linguistic information from plain-text files.


📂 Contents

This repository contains:

  • Talos_Text_Analyser.py — the complete source code (all-in-one Tkinter application)
  • Talos_Text_Analyser_Documentation.pdf — installation & usage guide (Version 2.0)

Overview

TheTALOS Text File Analyser (TFA) is a cross-platformdesktop tool (Tkinter) forinteractive text analysis. It can:

  • PerformWord frequency analysis
  • ExtractNouns (POS-based)
  • DetectPerson andLocation names (NER)
  • RunLemmatization
  • ExtractLexico-Syntactic Patterns (predefined & custom up to 5 tokens)
  • Export results toExcel (.xlsx) andCSV

Designed forresearchers, educators, and developers, with special support forAncient Greek via Stanza andmodern languages via spaCy.


Features

  • Language detection (automatic) with tuned English/Greek heuristics; supports Ancient & Modern Greek distinctions.
  • Six analysis modes: Words, Nouns, Person names, Location names, Lemmas, Pattern extraction.
  • Custom pattern builder (POS templates, wildcard, up to 5 positions).
  • Export to Excel (multi-sheet) and CSV.
  • Multilingual NLP: spaCy models for modern languages; Stanza forAncient Greek (grc).
  • Modern dark UI with progress bars and responsive feedback.

Run

# 1) Ensure Python 3.8+ is installedpython --version# 2) Install core librariespip install pandas openpyxl spacy langdetect stanza# 3) Install spaCy language models (essential)python -m spacy download en_core_web_smpython -m spacy download el_core_news_sm# Greek# 4) Install Stanza Ancient Greek model (one-time)python -<< 'PY'import stanzastanza.download("grc")PY# 5) Launch the GUI applicationpython Talos_Text_Analyser.py

Author

Prof. Christophe Roche — TALOS ERA Chair Holder — University of Crete

📧roche.university@gmail.com

🌐https://talos-ai4ssh.uoc.gr/


Citation

For general reference to the project:

Roche, C. (2025). TALOS Text File Analyser (Version 2.0).
TALOS AI4SSH Project, University of Crete.
https://talos-ai4ssh.uoc.gr/


License

All the code is distributed under theCreative Commons Attribution–NonCommercial (CC BY-NC 4.0) license.
You are free toshare and redistribute the material under the following conditions:

  • BY: Credit must be given to the creator(s).
  • NC: Only non-commercial uses are permitted.

More info:https://creativecommons.org/licenses/by-nc/4.0/


More Information


About

A comprehensive desktop application designed for intelligent text analysis and linguistic processing. Built with Python and tkinter, it provides advanced natural language processing capabilities including word frequency analysis, named entity recognition, pattern extraction, and multi-language support with automatic language detection.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages


[8]ページ先頭

©2009-2025 Movatter.jp