Disclosure of Invention
The invention provides a method for constructing an industrial chain view of a listed company, which aims to solve the problems and comprises the following steps: s1, determining the main business of the listed companies according to the financial reports of the listed companies, and clustering the listed companies with the main business in the same industry according to the classification of national industry standards;
s2, acquiring an upstream company and a downstream company of the main business of the listed company;
and S3, connecting the listed companies with the corresponding upstream companies and downstream companies by line segments by taking the company hierarchy as a node to form a view with the clustering property.
In step S1, a business having a revenue ratio of more than one third of the public company financial report is used as a main business.
Further, in step S2, the method for acquiring the upstream company and the downstream company of the main business of the listed company includes determining the upstream company and the downstream company according to the objects of the expense and revenue in the financial report of the listed company.
Further, the upstream company and the downstream company include a listed company and a non-listed company.
Further, the construction method further comprises optimizing the view by a force-directed algorithm.
The invention also provides a system for visualizing the industrial chain view of the listed companies, which comprises an original data module, a data processing module and a visualization module, wherein the original data module acquires the financial report information of the listed companies, acquires the names, the business operations, the revenue objects and amounts, and the expense objects and amounts in a keyword extraction mode, the data processing module determines downstream companies according to the revenue objects, determines upstream companies according to the expense objects, determines the main operation services of the listed companies according to the revenue ratios of the revenue objects, clusters the listed companies in the same main operation services, and the visualization module takes the company level as a node, connects the listed companies with the upstream companies and the downstream companies by taking a line segment as a connection, and visually outputs the industrial chain view.
Further, the data processing module determines the weight of a downstream company according to the revenue duty ratio of the revenue object, determines the weight of an upstream company according to the expenditure duty ratio of the expenditure object, and visually outputs the weight through the visualization module.
Further, the system also comprises an interaction module, and the interaction module is used for a user to select the output content of the visualization module.
Further, the system also comprises a data updating module, and the data updating module is used for updating the data in the original data module.
Furthermore, the system also comprises a classification adjusting module, and the classification adjusting module is used for adjusting the classification of the national standard industry according to the information collected in the financial reports.
The invention has the following beneficial effects: the method for constructing the view of the industrial chain of the listed company is provided, the view is constructed from two dimensions of an industry level and an industrial chain level, the listed company is clustered by taking a main business as a commonality, an investor is facilitated to judge the market environment of the listed company in the current industry, the upstream and downstream companies are taken as contact points, the position of the listed company on the whole industrial chain is reflected, and the accurate evaluation of the macroscopic environment of the listed company is facilitated.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. Unless otherwise specified, the technical means used in the examples are conventional means well known to those skilled in the art.
In the description of the present invention, it is to be understood that the terms "longitudinal", "lateral", "upper", "lower", "front", "rear", "left", "right", "vertical", "horizontal", "top", "bottom", "inner", "outer", and the like, indicate orientations or positional relationships based on those shown in the drawings, are merely for convenience of description of the present invention, and do not indicate or imply that the referenced devices or elements must have a particular orientation, be constructed and operated in a particular orientation, and thus, are not to be construed as limiting the present invention.
As shown in fig. 1, a method for constructing an industry chain view of a listed company includes:
s1, determining the main business of the listed companies according to the financial reports of the listed companies, and clustering the listed companies with the main business in the same industry according to the classification of national industry standards;
s2, acquiring an upstream company and a downstream company of the main business of the listed company;
and S3, connecting the listed companies with the corresponding upstream companies and the downstream companies by line segments by taking the company hierarchy as a node to form a view with the clustering property.
As shown in fig. 2, the different shapes of the graphics represent the listed companies in the same industry, and are clustered for different industries.
The operation condition of the listed company financial newspaper is accurately and comprehensively disclosed, the data extracted from the financial newspaper has representativeness and accuracy, in step S1, the business accounting for more than one third of the listed company financial newspaper is used as the main business, after the main business is determined, the listed companies with the main business in the same industry are clustered, the clustering takes the industry as the commonality, which represents the condition of the listed companies in the industry, namely the market environment, it can be understood that investors need to know the investment object and necessarily collect a large amount of information related to the investment object, the investment decision is further influenced by the information to evaluate the investment object, the large amount of information related to the investment object does not have good guidance, generally speaking, the information is scattered, the listed company clustering set with the industry as the commonality can provide good guidance for investors, so that the data can be directional at the beginning of collecting data, and a certain workload can be reduced; on the basis of national industry standard classification, on one hand, the matching relation of keywords is easy to obtain when data are collected, on the other hand, consensus is easy to form in the industry, for the national industry standard classification, a large class and a small class exist, so that the classification has a hierarchy, for example, agriculture is taken as a large class, the small classes such as planting, animal husbandry and aquaculture are arranged below the large class, a certain marketing company carries out major animal husbandry, the clustered marketing company is divided into two hierarchies, the first hierarchy comprises the marketing company of the animal husbandry, and the second hierarchy comprises all marketing companies belonging to the large class of the agricultural industry, such as the animal husbandry and the planting industry; in the industry chain dimension, in step S2, the method for obtaining the upstream company and the downstream company of the main business of the listed company includes determining the upstream company and the downstream company according to the objects of expense and revenue in the report of the listed company, obtaining the upstream company and the downstream company of the listed company, in the case that the same-industry cluster is formed, the upstream company and the downstream company of the listed company also have a cluster set, the upstream company and the downstream company obtained in the report data are the industry chain where the actual connection has occurred, and the other companies in the upstream company and the downstream company set are the industry chains where the connection is possible, and the environment where the enterprise is currently located in the industry chain is determined, for example, a certain upstream company has two upstream companies a and B, where the capacity of company a has a problem, and the operation of the listed company is influenced to a certain extent, when the investor is evaluating, the internal conditions in the current upstream company set can be intuitively known through the cluster set of the upstream companies, if the capacity is insufficient due to the individual reason of the company A, the operation of the upstream company can be ensured by replacing the upstream company, and if the problem exists in the whole industry of the current upstream company, the operation prospect of the listed company is not ideal; from the above, the industrial chain view with the industry clustering property has good guiding functions of investment guiding, analysis and the like;
the method is characterized in that company hierarchies are taken as nodes, listed companies and upstream and downstream companies thereof are connected through line segments, the connection between the companies is mainly displayed in a view, the line segments with direction indications can be used for connection to indicate specific upstream and downstream relations, the judgment of the relation between a large number of point lines by a user is facilitated when the whole industry view is displayed, for the view, the data of the corresponding company can be hung on the background of the view, the data can be any observable information and even comprise news information related to the company, the information can be used as content selectively viewed by the user, of course, the information can be correspondingly collected in a third party access mode, and the information can also be hung by an industry chain view builder.
For the same listed company, a plurality of main businesses may exist, and then when the company hierarchy is taken as a node to show the industry chain view, the industry cluster sets may be crossed, for a single industry, the conditions of the listed company in the industry and on the industry chain can be visually seen, when the crossed condition exists, the main business industry related to the listed company and the related industry chain can be integrated by taking the current listed company as a reference point to form a combined view, and the view can be used for a user to select independent display, so that the method is beneficial to clearing the industry chain condition of the main businesses related to a wide range of listed companies.
In fact, the industry cluster data and the upstream and downstream industry chain data are combined, and then the industry chain data with the industry cluster property is obtained.
For the concrete embodiment of the industry chain view, a network-like view based on network can be adopted, for example, the listed companies are used as network nodes in the graph, the connection between the listed companies and the upstream and downstream companies is used as a connecting line between the nodes, the industry classification of the listed companies can be used as a color distinguishing basis of the nodes, mapping is carried out in the view through different colors, the nodes of the listed companies under different industry classifications can be set to be different in shape, size and the like for better distinguishing the nodes, a prominent display mode can be adopted, for example, a user examines the C listed company, then the nodes and the connecting line related to the C listed company in the view are prominently displayed, and the modes of independently displaying related information, amplifying display, integral color area and the like can be adopted; for the industry chain view of the whole industry, the number of nodes is numerous, the connection relation is complex, the overall layout of the view is important, the view can be optimized through a force guiding algorithm, the industry chain relation of the whole industry and the industry chain relation of a single industry are respectively visualized through a force-guided dotted line connection diagram, the direct upstream and downstream relation and the industry clustering of the listed company are embodied by dotted line connection, for example, the overall effect of the view is controlled by controlling a node upper limit parameter and a connection weight parameter, the node upper limit is taken as a main control parameter, the connection weight parameter is understood as that the connection line is the relation between the listed company and the upstream and downstream companies, the weight can be directly understood as the operation and expenditure occupation ratio between the listed company and the upstream and downstream companies, when the occupation ratio is less than a certain value, the relation can be ignored, that is the upstream and downstream companies are not embodied on the view, therefore, the view is reasonably simplified, and the overall layout of the view is facilitated.
Preferably, the upstream companies and the downstream companies include a listed company and a non-listed company, as shown in fig. 3, a square represents a plurality of listed companies in a certain industry, an ellipse, a trapezoid and a fan respectively represent the upstream companies of the listed companies in the industry, wherein the ellipse and the trapezoid represent the listed companies, the fan represents the non-listed companies, the regular pentagon and the crescent respectively represent the downstream companies of the listed companies in the industry, and are all listed companies, of course, the downstream companies of the listed companies can also have non-listed companies, all related industries of the listed companies and the upstream and downstream industry chains of the industries can form a complete and continuous view for the whole industry chain view, and when analyzing a certain listed company, the importance of the non-listed company on the industry chain can not be ignored, whether the upstream company or the downstream company, as long as the operating or expenditure ratio exceeds a certain degree, the importance of the operating or expenditure ratio to the listed company can be explained, and in the industry chain view provided by the invention, the positions of the non-listed companies on the industry chain are shown in an endpoint form, and of course, the situation that nodes of the same non-listed company are used as different upstream and downstream endpoints exists, and actually, the information has important guiding significance on the industry chain view.
In the drawings, the figures with different shapes represent companies of different industries, including listed companies and non-listed companies, and different shapes, different colors or differential views formed by combining the different shapes and the different colors can be adopted to express the conditions of the different industries, the listed companies and the like in the specific implementation process.
The invention also provides a system for visualizing the industrial chain view of the listed companies, which comprises an original data module, a data processing module and a visualization module, wherein the original data module acquires the financial report information of the listed companies, acquires the names, the business operations, the revenue objects and amounts, and the expense objects and amounts in a keyword extraction mode, the data processing module determines downstream companies according to the revenue objects, determines upstream companies according to the expense objects, determines the main operation of the listed companies according to the revenue proportion of the revenue objects, clusters the listed companies in the same main operation, and the visualization module takes the company level as a node, connects the listed companies with the upstream companies and the downstream companies by taking a line segment as a link, and visually outputs the industrial chain view.
The modules can adopt a split or integrated design, only necessary data exchange needs to be completed, wherein the original data module mainly has the functions of data acquisition and key data extraction, information in the financial reports of listed companies exists in the original data module in a complete storage mode, the extracted key information is used as original data of the data processing module, the condition that natural semantic identification is inaccurate when keywords are extracted is considered, the information in the complete stored financial reports can be manually identified in a natural semantic way under the condition that the keywords are extracted wrongly, as an error correction means, a manually accessed input interface can be added, and the calibration is carried out when necessary, for example, in the soda industry, the soda industry contains soda ash and baking soda, the soda and the soda are respectively sodium carbonate and sodium bicarbonate, the keyword extraction at the moment can not accurately classify enterprises for preparing the sodium bicarbonate into the soda industry, manual identification is necessary at this time.
Preferably, the data processing module determines the weight of the downstream company according to the operating proportion of the operating object, determines the weight of the upstream company according to the expenditure proportion of the expenditure object, and visually outputs the weight through the visualization module, wherein the visual output of the weight can be embodied on a connecting line between the upstream company and the downstream company, and the weight of different upstream and downstream companies on the upstream company is embodied through the thickness of the connecting line.
As a preferred scheme, the system further comprises an interaction module, the interaction module is used for a user to select output contents of the visualization module, after the user determines an investment object, more people want to know the industry and the industrial environment where the object is located, then targeted display is very important, and the selection mode can adopt click selection, background hiding, highlight and other modes commonly used in the prior art.
Preferably, the system further comprises a data updating module, the data updating module is configured to update the data in the original data module, for a listed company, there may be a change in its main business, for example, a mineral enterprise changes its new energy battery, because the mineral enterprise has rich raw materials and has the development capability of the new energy battery, the main business is correspondingly increased or even changed, if the mineral enterprise originally has a greater influence on the industry, the change in its main business will have a greater impact on the related industry and the industry chain, and the updating module collects the relevant listed company information in real time to update the data in the original data module, so as to ensure the accuracy of the industry chain view.
As a preferable scheme, the system further includes a classification adjustment module, where the classification adjustment module is configured to adjust the national standard industry classification according to information collected in the financial reports, and for the classification adjustment, the classification adjustment is mainly based on a specific operation mode of some enterprises in a current market large environment, for example, breeding and slaughtering in animal husbandry are different classifications, and slaughtering in industry is a downstream enterprise of breeding, but in regions such as inner cover and tibetan where breeding is mainly concentrated, enterprises are bred and slaughtered at the same time, and in this case, industry clustering needs to be performed according to a specific operation condition of the company.
The above embodiments are merely illustrative of the preferred embodiments of the present invention, and do not limit the scope of the present invention, and those skilled in the art should make various changes, modifications, alterations, and substitutions on the technical solution of the present invention without departing from the spirit of the present invention, which falls within the protection scope defined by the claims of the present invention.