Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

This project is aimed to create an automated method that is able to identify emerging risks faced by multiple businesses and industries, and the trends of those risks.

NotificationsYou must be signed in to change notification settings

Dhruv26/10K-emerging-risk-detection

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

55 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Company annual financial reports are a very useful source of information about the company'sperformance,opportunities andrisks in the industries. Considering the amount of data contained in every report and large number of reports from different companies available, there is a strong motivation to automate processing of this information, therefore gaining more insights into the current state of industries.

This project is aimed to create an automated method that is able to identify emerging risks faced by multiple businesses and industries, and the trends of those risks. The research focuses ontopic modelling,keyword extraction,sentiment analysis andclustering inannual financial reports.
It compares the performance of the following methods when it comes to identifying emerging risks and clustering them:-

  • topic modelling
  • keyword extraction, supplemented with sentiment analysis and clustering

Extracting the most useful information from the textual financial
report data and correctly interpreting it is an ongoing challenge and
hopefully methods used in this project will prove to be useful in futureresearch.

This repo contains a framework that:-

  • Analyze the dataset and 10-K reports
  • Extract theRisk Factors section from 10-K reports
  • Develop a method to identify risks disclosed in theRisk Factors section
  • Classify a risk as an emerging one
  • Analyze and evaluate the results

The framework contains two main packages:-

  • risk_detection.preprocessing: This contains code to process 10-K reports and extract theRisk Factors section from10-K Reports.
  • risk_detection.analysis: This contains code to analyze theRisk Factors sections. It is responsible for trainingTopic Models,Keyword Extraction,Sentiment Analysis andClustering. It also contains an algorithm which compares clusters across years and detect new clusters. This package also contains code to group 10-K reports by industries and train any models required separately.

Most of the analysis has been done in Jupyter Notebooks and are available in thenotebooks/final_submission folder.

About

This project is aimed to create an automated method that is able to identify emerging risks faced by multiple businesses and industries, and the trends of those risks.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors2

  •  
  •  

[8]ページ先頭

©2009-2025 Movatter.jp