Movatterモバイル変換


[0]ホーム

URL:


CN111901172B - Application service monitoring method and system based on cloud computing environment - Google Patents

Application service monitoring method and system based on cloud computing environment
Download PDF

Info

Publication number
CN111901172B
CN111901172BCN202010748386.XACN202010748386ACN111901172BCN 111901172 BCN111901172 BCN 111901172BCN 202010748386 ACN202010748386 ACN 202010748386ACN 111901172 BCN111901172 BCN 111901172B
Authority
CN
China
Prior art keywords
application service
abnormal
alarm
state
monitoring client
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202010748386.XA
Other languages
Chinese (zh)
Other versions
CN111901172A (en
Inventor
何耀
方亚东
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inspur Cloud Information Technology Co Ltd
Original Assignee
Inspur Cloud Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inspur Cloud Information Technology Co LtdfiledCriticalInspur Cloud Information Technology Co Ltd
Priority to CN202010748386.XApriorityCriticalpatent/CN111901172B/en
Publication of CN111901172ApublicationCriticalpatent/CN111901172A/en
Application grantedgrantedCritical
Publication of CN111901172BpublicationCriticalpatent/CN111901172B/en
Activelegal-statusCriticalCurrent
Anticipated expirationlegal-statusCritical

Links

Images

Classifications

Landscapes

Abstract

The invention discloses an application service monitoring method and system based on cloud computing environment, belonging to the field of government cloud environment analysis, and aiming at the technical problems of centralized supervision, analysis and visual display of application service state, performance information and the like, the invention ensures the rapid positioning of application service faults in cloud environment, and adopts the following technical scheme: the method comprises the following steps: s1, detecting application service by adopting a multi-source detection mode; s2, deploying an aggregation application service monitoring client in an OpenCenter environment; s3, deploying a local application service monitoring client on a collector of the government cloud center; and S4, running the application service clients deployed on the convergence and the local in a timed task mode, and periodically monitoring the transition of the application service state. The system comprises a detection unit, a convergence deployment unit, a local deployment unit, a period monitoring unit, an alarm pushing unit, an alarm clearing unit, a visualization unit and a verification unit.

Description

Application service monitoring method and system based on cloud computing environment
Technical Field
The invention relates to the field of government cloud environment analysis, in particular to an application service monitoring method and system based on a cloud computing environment.
Background
With the rapid development and maturity of cloud computing, more and more user data are migrated to a cloud computing cloud server, and in addition, the application service scale is gradually expanded due to the cloud policy of government departments, so that great challenges are brought to the management and maintenance of application services of all government cloud centers, and unified monitoring and management application are imperative.
In the face of huge and complex network environment under cloud computing, on the premise of avoiding influencing the use of clients, timely discovery of application services when faults occur becomes particularly important, feedback is carried out only when the clients cannot normally use the network environment in the past, related personnel can conduct check processing, the obstacle clearing process is relatively complex, and the user experience of the clients and the processing efficiency of operation and maintenance personnel are greatly influenced. Therefore, how to perform centralized supervision, analysis and visual display on the state and performance information of the application service, and the like, and ensure that the application service fault in the cloud environment is rapidly located is a technical problem to be solved at present.
Disclosure of Invention
The technical task of the invention is to provide an application service monitoring method and system based on a cloud computing environment, which are used for solving the problems of centralized supervision, analysis and visual display of state and performance information of application services and the like and ensuring quick positioning of application service faults in the cloud environment.
The technical task of the invention is realized in the following way, and the application service monitoring method based on the cloud computing environment comprises the following steps:
s1, detecting application service by adopting a multi-source detection mode;
s2, deploying an aggregation application service monitoring client in an OpenCenter environment;
s3, deploying a local application service monitoring client on a collector of the government cloud center;
s4, running the application service clients deployed on the convergence and the local in a timing task mode, and periodically monitoring the transition of the application service state;
s5, the application service monitoring client side pushes the alarm to the alarm system message queue from the application service which normally changes the service state into abnormal state;
s6, the application service monitoring client pushes and clears the alarm to an alarm system message queue when the abnormal service state is converted into normal application service;
s7, displaying abnormal alarms pushed by the application service monitoring client and carried performance data in a visual mode of an alarm list by the foreground page, and accurately and quickly analyzing the current service state through the performance data of the application service;
s8, the OpenCenter convergence application service monitoring client performs abnormal data verification with the local application service monitoring client of each government cloud center.
Preferably, the detection of the application service in the step S1 by adopting a multi-source detection method is specifically as follows:
s101, detecting the internal environment and judging whether the internal environment is smooth or not:
(1) if the internal detection is not successful, locating as the state reason of the application itself;
(2) if the internal detection is clear, taking network reasons into consideration, executing step S102 next;
s102, checking network reasons: starting from the nat conversion, the nat converts the external network address into the internal network address through the router or firewall;
s103, judging whether conversion exists by acquiring a nat conversion table on the router or the firewall:
(1) if so, positioning the problem as an extranet problem;
(2) if the problem does not exist, positioning the problem to the intranet problem;
in the step S2, a converged application service monitoring client is deployed in an OpenCenter environment as a monitoring center to centrally and uniformly control government cloud center application services of all nanotubes, and the concurrent processing performance is reminded in a multithreading mode, wherein the concurrent processing is to ensure that the monitoring period is within 5 minutes, divide every 100 application services into separate sub-program detection processes, and execute all sub-programs concurrently, thereby effectively ensuring the high efficiency of the processing;
and an application service monitoring client additionally provided with an intranet monitoring mode is deployed on the acquisition machine in the step S3, and when the condition that the reference service runs normally but the program detection is abnormal due to network factors of unconfigured routing occurs on the acquisition machine, a cloud center ntp server is used as a front-end processor agent to execute a command through elastic configuration of a relevant configuration file, so that the high availability of the program is ensured.
Preferably, the timing task manner in the step S4 means that the client performs application service monitoring at a timing with granularity of 5 minutes;
the transition of the service state in the step S4 includes a transition between an abnormal state and a normal state; wherein the abnormal state includes an application service state abnormality (pages 400, 404, etc.) and an application service timeout abnormality (connection timeout, resolution timeout);
preferably, the abnormal application service in step S5 means that in the same period, if the detection is normal, the detection is repeated until 3 times, and the client records the final abnormal result to the cache; in order to prevent the influence of network fluctuation and other reasons, when 3 continuous periods are abnormal, pushing the alarm, wherein the 3 continuous periods still are abnormal and record continuous times, but the alarm is not repeatedly sent;
the application service monitoring client in step S6 compares whether the normal application service is in the abnormal service cache or not:
if yes, pushing the clearing alarm to an alarm system message queue for alarm clearing, recording the maximum continuous abnormal times and clearing the current continuous abnormal times, and when the next continuous times of the application service reach the maximum continuous abnormal times again, sending the alarm again;
if not, the method is skipped normally.
Preferably, the monitoring process of the OpenCenter converged application service monitoring client in the step S8 is specifically as follows:
deploying an application service monitoring program on an OpenCenter converged application service monitoring client;
flexibly configuring application service type information to be monitored and information of alarm pushing; the application service type information comprises a domain name and an external network ip address;
thirdly, detecting the domain name by using a curl command, returning to the state code 200 to normally pass, and judging whether to continuously monitor 3 times of abnormality when returning to abnormality, so as to prevent the cause interference of network fluctuation:
(1) if yes, executing the step (IV);
(2) if not, jumping to the step (nine);
(IV) judging whether the anomaly type is an application service state anomaly or an application service timeout anomaly:
(1) if the application service status is abnormal (i.e., there is a specific exception return code 400 or 404), then performing step (eight);
(2) if the application service timeout is abnormal (connection timeout, analysis timeout, etc.) or the state returns to the application service of 000, executing the step (fifth);
(V) detecting the domain name again by a wget command;
(VI), dynamically detecting the overtime condition by adopting an atomic suppression mode by the wget, and judging whether the overtime condition is abnormal or not:
(1) if yes, executing the step (seventh);
(2) if not, regarding the application service of which the Curl is abnormal and the wget is normal as normal processing, wherein the initial timeout time is 15s, and when the detection is carried out for 5s for 3 times continuously and every time, the service performance is considered to be problematic (the user experience is affected), and the step (nine) is skipped;
seventhly, detecting the domain name overtime abnormality again by using a curl to the external network ip address, exploring whether application service is unavailable due to the analysis problem, and simulating and logging the use command to monitor the state of the application service when detecting whether the application service is available after logging is required; judging whether to monitor 3 times of abnormality continuously:
(1) if yes, executing the step (eight);
(2) if not, jumping to the step (eight);
pushing alarm information, namely pushing and clearing alarms to an alarm system message queue when the abnormal application service information is cached each time and detected as normal next time, and zeroing the current continuous abnormal times;
and (ninth), ending.
Preferably, the monitoring process of the local application service monitoring client of the government cloud center in the step S8 is specifically as follows:
(1) An application service monitoring program is deployed at each government cloud center local application service monitoring client;
(2) Flexibly configuring application service types to be monitored, wherein the application service types comprise domain names, external network ip addresses and internal network ip addresses; when the intranet detection is affected due to the route configuration, additionally configuring a cloud center ntp server as a front-end processor calling command to detect;
(3) Detecting a domain name by using a curl command, returning the state code 200 to normally pass, and judging whether to continuously monitor 3 times of abnormality when returning to abnormality, so as to prevent the cause interference of network fluctuation:
(1) if yes, executing the step (4);
(2) if not, jumping to the step (10);
(4) Judging whether the anomaly type is an application service state anomaly or an application service timeout anomaly:
(1) if the application service status is abnormal (i.e. there is a specific exception return code 400 or 404), then step (9) is performed;
(2) if the application service timeout is abnormal (connection timeout, analysis timeout, etc.) or the state returns to the application service of 000, executing the step (5);
(5) Detecting the domain name again by a wget command;
(6) Dynamically detecting the overtime condition by adopting an atomic suppression mode, and judging whether the overtime condition is abnormal or not:
(1) if yes, executing the step (7);
(2) if not, regarding the application service of which the Curl is abnormal and the wget is normal as normal processing, wherein the initial timeout time is 15s, and when the detection is carried out for 5s for 3 times continuously and every time, the service performance is considered to be problematic (the user experience is affected), and the step (9) is skipped;
(7) Detecting the domain name overtime abnormality by using a curl again for the external network ip address, exploring whether application service is unavailable due to the analysis problem, and simulating and logging in a command to monitor the state of the application service when detecting whether the application service is available after logging in is required; judging whether to monitor 3 times of abnormality continuously:
(1) if yes, executing the step (8);
(2) if not, jumping to the step (9);
(8) Performing intranet detection by using a curl, and judging whether the intranet is abnormal three times continuously:
(1) if yes, executing the step (9);
(2) if not, executing the step (9);
(9) Pushing alarm information, namely pushing and clearing alarms to an alarm system message queue when the abnormal application service information is cached for each time and detected as normal next time, and zeroing the current continuous abnormal times;
(10) And ending.
In the application service detection process, the granularity of the monitoring time is 5 minutes, the current abnormal times (normal default is 0 and the abnormalities are accumulated) and the historical maximum abnormal times (normal default is 3 and the abnormalities are accumulated) of the cache are detected each time, and when the historical maximum abnormal times are not reached, the alarm is not pushed and only the current times are accumulated; when the current times of abnormal application services are greater than or equal to the historical maximum times, pushing alarms to an alarm system message queue, and accumulating the current and historical abnormal times at the same time.
More preferably, the abnormal data verification between the OpenCenter converged application service monitoring client in the step S8 and the local application service monitoring client of each government cloud center is specifically as follows:
the OpenCenter converged application service monitoring client and the local application service monitoring client of the government cloud center execute monitoring programs at the same granularity, and when abnormal alarms are detected, the OpenCenter converged application service monitoring client and the local application service monitoring client can communicate in real time, and whether the OpenCenter converged application service monitoring client and the local application service monitoring client are the same abnormal application service is compared:
when abnormality is detected at the same time, normally pushing abnormal application alarm information;
when one part detects the abnormality and the other part does not detect the abnormality, the operation and maintenance personnel are informed to conduct investigation, and the alarm accuracy is ensured.
An application service monitoring system based on cloud computing environment, the system comprises,
the detection unit is used for detecting the application service in a multi-source detection mode;
the convergence deployment unit is used for deploying the convergence application service monitoring client in the OpenCenter environment;
the local deployment unit is used for deploying the local application service monitoring client on the acquisition machine of the government cloud center;
the periodic monitoring unit is used for deploying the application service clients on the convergence and the local to run in a timed task mode and periodically monitoring the transition of the application service state;
the alarm pushing unit is used for pushing the alarm to the alarm system message queue from the application service monitoring client side for normally converting the service state into the abnormal service state;
the alarm clearing unit is used for pushing and clearing alarms to an alarm system message queue from the application service monitoring client side for converting the abnormal service state into normal application service;
the visualization unit is used for displaying abnormal alarms and carried performance data pushed by the application service monitoring client in a visualization mode of the alarm list on the foreground page, and accurately and quickly analyzing the current service state through the performance data of the application service;
the verification unit is used for carrying out abnormal data verification on the OpenCenter converged application service monitoring client and the local application service monitoring client of each government cloud center; the method comprises the following steps:
after the OpenCenter convergence application service monitoring client detects the abnormality, checking whether the corresponding local cloud center detects the same abnormal condition or not when pushing an alarm:
if yes, normally pushing an alarm;
if not, notifying corresponding operation and maintenance personnel to check and treat, and ensuring the accuracy of the abnormal information.
An electronic device, comprising: a memory and at least one processor;
wherein the memory stores computer-executable instructions;
the at least one processor executes the computer-executable instructions stored by the memory, causing the at least one processor to perform an application service monitoring method in a cloud-based computing environment as described above.
A computer readable storage medium having stored therein computer executable instructions that when executed by a processor perform an application service monitoring method in a cloud-based computing environment as described above.
The application service monitoring method and system based on the cloud computing environment have the following advantages:
the invention can continuously detect the application service state throughout the day through application service monitoring, find abnormality and process under the condition that the client does not sense, or can ensure that the client is informed in advance to improve the user experience, rather than waiting for the feedback of the client to know afterward and leak repair;
secondly, an application service monitoring program is deployed in a government cloud center, so that the problem that the application service cannot be found in time when the application service is abnormal in the government cloud environment is solved, and the reasons, positions and the like of the fault problem can be rapidly positioned according to the pushed alarm information and performance data;
the method and the system can automatically analyze the quality and the state of the application service in the government cloud environment, can automatically push the alarm aiming at the generation of abnormal conditions, can automatically trigger the alarm clearing mechanism after the abnormal recovery, and have high sensitivity and high accuracy.
Drawings
The invention is further described below with reference to the accompanying drawings.
FIG. 1 is a flow diagram of an application service monitoring method based on a cloud computing environment;
FIG. 2 is a flow chart of OpenCenter converged application service monitoring client monitoring;
FIG. 3 is a block flow diagram of local application service monitoring client monitoring in a government cloud center;
FIG. 4 is a schematic diagram of the nat mapping principle;
FIG. 5 is an interface screenshot of an alarm list presentation;
fig. 6 is a schematic diagram of an OpenCenter converged application service monitoring client performing abnormal data verification with a local application service monitoring client of each government cloud center.
Detailed Description
The application service monitoring method and system based on the cloud computing environment of the present invention are described in detail below with reference to the accompanying drawings and specific embodiments.
Example 1:
as shown in fig. 1, the application service monitoring method based on the cloud computing environment of the present invention specifically comprises the following steps:
s1, detecting application service by adopting a multi-source detection mode; the method comprises the following steps:
s101, detecting the internal environment and judging whether the internal environment is smooth or not:
(1) if the internal detection is not successful, locating as the state reason of the application itself;
(2) if the internal detection is clear, taking network reasons into consideration, executing step S102 next;
s102, checking network reasons: starting from the nat conversion, the nat converts the external network address into the internal network address through the router or firewall;
s103, judging whether conversion exists by acquiring a nat conversion table on the router or the firewall:
(1) if so, positioning the problem as an extranet problem;
(2) if the problem does not exist, the problem is positioned to the intranet problem.
As shown in fig. 4, the mapping principle of nat is that legal ip addresses are respectively interconnected with external network and nat mapping, and nat mapping is interconnected with internal network.
S2, deploying an aggregation application service monitoring client in an OpenCenter environment, intensively and uniformly controlling government cloud center application services of all nanotubes as a monitoring center, reminding concurrent processing performance in a multithreading mode, wherein the concurrent processing is to ensure that the monitoring period is within 5 minutes, dividing every 100 application services into independent subprograms for detection processing, and executing all subprograms concurrently, thereby effectively ensuring the high efficiency of processing;
s3, deploying a local application service monitoring client on a collector of the government cloud center; an application service monitoring client in an intranet monitoring mode is additionally arranged on the acquisition machine, and when the condition that the reference service runs normally but the program detection is abnormal due to network factors of unconfigured routing occurs on the acquisition machine, a cloud center ntp server is used as a front-end processor agent to execute a command through elastic configuration of a related configuration file, so that high availability of the program is ensured.
S4, running the application service clients deployed on the convergence and the local in a timing task mode, and periodically monitoring the transition of the application service state; the timing task mode is that the client side executes application service monitoring at regular time with granularity of 5 minutes; the transition of the service state includes a transition between the abnormal state and the normal state; wherein the abnormal state includes an application service state abnormality (pages 400, 404, etc.) and an application service timeout abnormality (connection timeout, resolution timeout);
s5, the application service monitoring client side pushes the alarm to the alarm system message queue from the application service which normally changes the service state into abnormal state; the abnormal application service means that in the same period, if the detection is normal, the abnormal application service passes, if the detection is abnormal, the detection is repeated for 3 times, and the client records the final abnormal result to the cache; in order to prevent the influence of network fluctuation and other reasons, when 3 continuous periods are abnormal, pushing the alarm, wherein the 3 continuous periods still are abnormal and record continuous times, but the alarm is not repeatedly sent;
s6, the application service monitoring client pushes and clears the alarm to an alarm system message queue when the abnormal service state is converted into normal application service; wherein the application service monitoring client will compare whether this normal application service is in the abnormal service cache:
if yes, pushing the clearing alarm to an alarm system message queue for alarm clearing, recording the maximum continuous abnormal times and clearing the current continuous abnormal times, and when the next continuous times of the application service reach the maximum continuous abnormal times again, sending the alarm again;
if not, the method is skipped normally.
S7, displaying abnormal alarms pushed by the application service monitoring client and carried performance data in a visual mode of an alarm list by the foreground page, and accurately and quickly analyzing the current service state through the performance data of the application service; as shown in fig. 5, the page shows the following specific functions:
(1) the operation and maintenance personnel can check the application service alarm and detailed information in any time period through the provided Web interface; the detailed information comprises an abnormal application service name, a domain name, an external network ip address, an abnormal application service type, a possible reason and the like;
(2) the front page can manually clear the abnormal alarm by operation and maintenance personnel.
S8, the OpenCenter convergence application service monitoring client performs abnormal data verification with the local application service monitoring client of each government cloud center.
As shown in fig. 2, the monitoring process of the OpenCenter converged application service monitoring client specifically includes:
deploying an application service monitoring program on an OpenCenter converged application service monitoring client;
flexibly configuring application service type information to be monitored and information of alarm pushing; the application service type information comprises a domain name and an external network ip address;
thirdly, detecting the domain name by using a curl command, returning to the state code 200 to normally pass, and judging whether to continuously monitor 3 times of abnormality when returning to abnormality, so as to prevent the cause interference of network fluctuation:
(1) if yes, executing the step (IV);
(2) if not, jumping to the step (nine);
(IV) judging whether the anomaly type is an application service state anomaly or an application service timeout anomaly:
(1) if the application service status is abnormal (i.e., there is a specific exception return code 400 or 404), then performing step (eight);
(2) if the application service timeout is abnormal (connection timeout, analysis timeout, etc.) or the state returns to the application service of 000, executing the step (fifth);
(V) detecting the domain name again by a wget command;
(VI), dynamically detecting the overtime condition by adopting an atomic suppression mode by the wget, and judging whether the overtime condition is abnormal or not:
(1) if yes, executing the step (seventh);
(2) if not, regarding the application service of which the Curl is abnormal and the wget is normal as normal processing, wherein the initial timeout time is 15s, and when the detection is carried out for 5s for 3 times continuously and every time, the service performance is considered to be problematic (the user experience is affected), and the step (nine) is skipped;
seventhly, detecting the domain name overtime abnormality again by using a curl to the external network ip address, exploring whether application service is unavailable due to the analysis problem, and simulating and logging the use command to monitor the state of the application service when detecting whether the application service is available after logging is required; judging whether to monitor 3 times of abnormality continuously:
(1) if yes, executing the step (eight);
(2) if not, jumping to the step (eight);
pushing alarm information, namely pushing and clearing alarms to an alarm system message queue when the abnormal application service information is cached each time and detected as normal next time, and zeroing the current continuous abnormal times;
and (ninth), ending.
As shown in fig. 3, the monitoring process of the local application service monitoring client of the government cloud center is specifically as follows:
(1) An application service monitoring program is deployed at each government cloud center local application service monitoring client;
(2) Flexibly configuring application service types to be monitored, wherein the application service types comprise domain names, external network ip addresses and internal network ip addresses; when the intranet detection is affected due to the route configuration, additionally configuring a cloud center ntp server as a front-end processor calling command to detect;
(3) Detecting a domain name by using a curl command, returning the state code 200 to normally pass, and judging whether to continuously monitor 3 times of abnormality when returning to abnormality, so as to prevent the cause interference of network fluctuation:
(1) if yes, executing the step (4);
(2) if not, jumping to the step (10);
(4) Judging whether the anomaly type is an application service state anomaly or an application service timeout anomaly:
(1) if the application service status is abnormal (i.e. there is a specific exception return code 400 or 404), then step (9) is performed;
(2) if the application service timeout is abnormal (connection timeout, analysis timeout, etc.) or the state returns to the application service of 000, executing the step (5);
(5) Detecting the domain name again by a wget command;
(6) Dynamically detecting the overtime condition by adopting an atomic suppression mode, and judging whether the overtime condition is abnormal or not:
(1) if yes, executing the step (7);
(2) if not, regarding the application service of which the Curl is abnormal and the wget is normal as normal processing, wherein the initial timeout time is 15s, and when the detection is carried out for 5s for 3 times continuously and every time, the service performance is considered to be problematic (the user experience is affected), and the step (9) is skipped;
(7) Detecting the domain name overtime abnormality by using a curl again for the external network ip address, exploring whether application service is unavailable due to the analysis problem, and simulating and logging in a command to monitor the state of the application service when detecting whether the application service is available after logging in is required; judging whether to monitor 3 times of abnormality continuously:
(1) if yes, executing the step (8);
(2) if not, jumping to the step (9);
(8) Performing intranet detection by using a curl, and judging whether the intranet is abnormal three times continuously:
(1) if yes, executing the step (9);
(2) if not, executing the step (9);
(9) Pushing alarm information, namely pushing and clearing alarms to an alarm system message queue when the abnormal application service information is cached for each time and detected as normal next time, and zeroing the current continuous abnormal times;
(10) And ending.
In the application service detection process, the granularity of the monitoring time is 5 minutes, the current abnormal times (normal default is 0 and the abnormalities are accumulated) and the historical maximum abnormal times (normal default is 3 and the abnormalities are accumulated) of the cache are detected each time, and when the historical maximum abnormal times are not reached, the alarm is not pushed and only the current times are accumulated; when the current times of abnormal application services are greater than or equal to the historical maximum times, pushing alarms to an alarm system message queue, and accumulating the current and historical abnormal times at the same time.
As shown in fig. 6, the abnormal data verification between the OpenCenter converged application service monitoring client and the local application service monitoring client of each government cloud center is specifically as follows:
the OpenCenter converged application service monitoring client and the local application service monitoring client of the government cloud center execute monitoring programs at the same granularity, and when abnormal alarms are detected, the OpenCenter converged application service monitoring client and the local application service monitoring client can communicate in real time, and whether the OpenCenter converged application service monitoring client and the local application service monitoring client are the same abnormal application service is compared:
when abnormality is detected at the same time, normally pushing abnormal application alarm information;
when one part detects the abnormality and the other part does not detect the abnormality, the operation and maintenance personnel are informed to conduct investigation, and the alarm accuracy is ensured.
Example 2:
the invention relates to an application service monitoring system based on cloud computing environment, which comprises,
the detection unit is used for detecting the application service in a multi-source detection mode;
the convergence deployment unit is used for deploying the convergence application service monitoring client in the OpenCenter environment;
the local deployment unit is used for deploying the local application service monitoring client on the acquisition machine of the government cloud center;
the periodic monitoring unit is used for deploying the application service clients on the convergence and the local to run in a timed task mode and periodically monitoring the transition of the application service state;
the alarm pushing unit is used for pushing the alarm to the alarm system message queue from the application service monitoring client side for normally converting the service state into the abnormal service state;
the alarm clearing unit is used for pushing and clearing alarms to an alarm system message queue from the application service monitoring client side for converting the abnormal service state into normal application service;
the visualization unit is used for displaying abnormal alarms and carried performance data pushed by the application service monitoring client in a visualization mode of the alarm list on the foreground page, and accurately and quickly analyzing the current service state through the performance data of the application service;
the verification unit is used for carrying out abnormal data verification on the OpenCenter converged application service monitoring client and the local application service monitoring client of each government cloud center; the method comprises the following steps:
after the OpenCenter convergence application service monitoring client detects the abnormality, checking whether the corresponding local cloud center detects the same abnormal condition or not when pushing an alarm:
if yes, normally pushing an alarm;
if not, notifying corresponding operation and maintenance personnel to check and treat, and ensuring the accuracy of the abnormal information.
Example 3:
the embodiment of the invention also provides electronic equipment, which comprises: a memory and a processor;
wherein the memory stores computer-executable instructions;
one processor executes the computer-executable instructions stored by the memory, causing the one processor to perform the application service monitoring method in a cloud-based computing environment as in embodiment 1.
Example 6:
the embodiment of the invention also provides a computer readable storage medium, wherein a plurality of instructions are stored, and the instructions are loaded by a processor, so that the processor executes the application service monitoring method based on the cloud computing environment in any embodiment of the invention. Specifically, a system or apparatus provided with a storage medium on which a software program code realizing the functions of any of the above embodiments is stored, and a computer (or CPU or MPU) of the system or apparatus may be caused to read out and execute the program code stored in the storage medium.
In this case, the program code itself read from the storage medium may realize the functions of any of the above-described embodiments, and thus the program code and the storage medium storing the program code form part of the present invention.
Examples of storage media for providing program code include floppy disks, hard disks, magneto-optical disks, optical disks (e.g., CD-ROM, CD-R, CD-RW, DVD-ROM, DVD-R based application service monitoring methods and systems M, DVD-RW, DVD+RW) in a cloud computing environment, magnetic tapes, nonvolatile memory cards, and ROMs. Alternatively, the program code may be downloaded from a server computer by a communication network.
Further, it should be apparent that the functions of any of the above-described embodiments may be implemented not only by executing the program code read out by the computer, but also by causing an operating system or the like operating on the computer to perform part or all of the actual operations based on the instructions of the program code.
Further, it is understood that the program code read out by the storage medium is written into a memory provided in an expansion board inserted into a computer or into a memory provided in an expansion unit connected to the computer, and then a CPU or the like mounted on the expansion board or the expansion unit is caused to perform part and all of actual operations based on instructions of the program code, thereby realizing the functions of any of the above embodiments.
Finally, it should be noted that: the above embodiments are only for illustrating the technical solution of the present invention, and not for limiting the same; although the invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical scheme described in the foregoing embodiments can be modified or some or all of the technical features thereof can be replaced by equivalents; such modifications and substitutions do not depart from the spirit of the invention.

Claims (9)

CN202010748386.XA2020-07-302020-07-30Application service monitoring method and system based on cloud computing environmentActiveCN111901172B (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
CN202010748386.XACN111901172B (en)2020-07-302020-07-30Application service monitoring method and system based on cloud computing environment

Applications Claiming Priority (1)

Application NumberPriority DateFiling DateTitle
CN202010748386.XACN111901172B (en)2020-07-302020-07-30Application service monitoring method and system based on cloud computing environment

Publications (2)

Publication NumberPublication Date
CN111901172A CN111901172A (en)2020-11-06
CN111901172Btrue CN111901172B (en)2023-04-21

Family

ID=73183426

Family Applications (1)

Application NumberTitlePriority DateFiling Date
CN202010748386.XAActiveCN111901172B (en)2020-07-302020-07-30Application service monitoring method and system based on cloud computing environment

Country Status (1)

CountryLink
CN (1)CN111901172B (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN113485864B (en)*2021-07-262024-05-14北京达佳互联信息技术有限公司Abnormality detection, abnormality analysis method, abnormality detection device, abnormality analysis device, electronic apparatus, and storage medium
US20240054231A1 (en)*2022-08-152024-02-15Microsoft Technology Licensing, LlcCloud-agnostic code analysis
CN115442214B (en)*2022-09-022024-12-31中国工商银行股份有限公司 Business abnormality troubleshooting method, device, equipment, storage medium and program product
CN115664990A (en)*2022-10-092023-01-31浪潮云信息技术股份公司Monitoring data aggregation and processing method in government affair cloud scene
CN116383208B (en)*2023-03-302025-09-26浪潮云信息技术股份公司 A method, device, equipment and storage medium for data aggregation at multiple time granularities

Citations (5)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US7577701B1 (en)*2001-01-222009-08-18Insightete CorporationSystem and method for continuous monitoring and measurement of performance of computers on network
CN102523115A (en)*2011-12-162012-06-27广东高新兴通信股份有限公司Server monitoring system based on power environment system
CN109144835A (en)*2018-08-022019-01-04广东浪潮大数据研究有限公司A kind of automatic prediction method, device, equipment and the medium of application service failure
CN110149421A (en)*2019-05-302019-08-20世纪龙信息网络有限责任公司Method for monitoring abnormality, system, device and the computer equipment of domain name system
CN111193609A (en)*2019-11-202020-05-22腾讯科技(深圳)有限公司Application abnormity feedback method and device and application abnormity monitoring system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US7577701B1 (en)*2001-01-222009-08-18Insightete CorporationSystem and method for continuous monitoring and measurement of performance of computers on network
CN102523115A (en)*2011-12-162012-06-27广东高新兴通信股份有限公司Server monitoring system based on power environment system
CN109144835A (en)*2018-08-022019-01-04广东浪潮大数据研究有限公司A kind of automatic prediction method, device, equipment and the medium of application service failure
CN110149421A (en)*2019-05-302019-08-20世纪龙信息网络有限责任公司Method for monitoring abnormality, system, device and the computer equipment of domain name system
CN111193609A (en)*2019-11-202020-05-22腾讯科技(深圳)有限公司Application abnormity feedback method and device and application abnormity monitoring system

Also Published As

Publication numberPublication date
CN111901172A (en)2020-11-06

Similar Documents

PublicationPublication DateTitle
CN111901172B (en)Application service monitoring method and system based on cloud computing environment
US11442803B2 (en)Detecting and analyzing performance anomalies of client-server based applications
US10346744B2 (en)System and method for visualisation of behaviour within computer infrastructure
US8504679B2 (en)Methods, systems and computer program products for managing execution of information technology (IT) processes
US8972783B2 (en)Systems and methods for fast detection and diagnosis of system outages
CN110309130A (en)A kind of method and device for host performance monitor
US20200409822A1 (en)Intelligent services and training agent for application dependency discovery, reporting, and management tool
US8326971B2 (en)Method for using dynamically scheduled synthetic transactions to monitor performance and availability of E-business systems
US10931533B2 (en)System for network incident management
CN113946499A (en)Micro-service link tracking and performance analysis method, system, equipment and application
CN106649040A (en)Automatic monitoring method and device for performance of Weblogic middleware
CN111163150A (en)Distributed calling tracking system
CN117422434A (en)Wisdom fortune dimension dispatch platform
US10528456B2 (en)Determining idle testing periods
CN114356499A (en) Root cause analysis method and device for Kubernetes cluster alarm
CA3144664A1 (en)Determining problem dependencies in application dependency discovery, reporting, and management tool
US20250077631A1 (en)Method and system for investigating resiliency of a software application
CN107635003A (en) System log management method, device and system
CN109240863A (en)A kind of cpu fault localization method, device, equipment and storage medium
CN112445677B (en)Health state checking and repairing method and storage medium for computer cluster
CN116204386B (en)Method, system, medium and equipment for automatically identifying and monitoring application service relationship
CN115102838B (en)Emergency processing method and device for server downtime risk and electronic equipment
CN113900898B (en)Data processing system, equipment and medium
CN111143154A (en)Wharf operating system operation monitoring method and device, server and storage medium
CN113626288A (en)Fault processing method, system, device, storage medium and electronic equipment

Legal Events

DateCodeTitleDescription
PB01Publication
PB01Publication
SE01Entry into force of request for substantive examination
SE01Entry into force of request for substantive examination
GR01Patent grant
GR01Patent grant

[8]ページ先頭

©2009-2025 Movatter.jp