


技术领域technical field
本发明涉及计算机应用技术领域,尤其是涉及一种链路监控方法、告警方法、系统、终端及存储介质。The present invention relates to the technical field of computer applications, in particular to a link monitoring method, an alarm method, a system, a terminal and a storage medium.
背景技术Background technique
随着微服务架构的流行,服务根据不同的维度进行细化,因此一次请求往往需要涉及到多个服务。互联网应用构建在不同的应用模块集上,这些应用模块可能使用不同的语言进行编程,分布在不同的服务器和数据中心,相关的链路监控技术难以实现跨语言和平台的链路监控。因此,就需要一些可以帮助理解系统行为、用于分析性能问题的工具,以便发生故障的时候,能够快速定位和解决问题。应用全链路监控能够实现对复杂架构的数据分析和监控。With the popularity of microservice architecture, services are refined according to different dimensions, so a request often needs to involve multiple services. Internet applications are built on different sets of application modules. These application modules may be programmed in different languages and distributed in different servers and data centers. It is difficult for related link monitoring technologies to achieve cross-language and platform link monitoring. Therefore, there is a need for tools that can help understand system behavior and analyze performance problems so that when failures occur, problems can be quickly located and resolved. The application of full-link monitoring enables data analysis and monitoring of complex architectures.
目前,全链路监控根据通用逻辑进行监控,缺乏定制化拓展,无法满足不同的监控需求场景,需要引进额外的监控系统,增加了使用成本。At present, full-link monitoring is based on general logic, lacks customized expansion, cannot meet different monitoring demand scenarios, and requires the introduction of additional monitoring systems, which increases the cost of use.
发明内容SUMMARY OF THE INVENTION
本发明旨在至少解决现有技术中存在的技术问题之一。为此,本发明提出一种链路监控方法、告警方法、系统、终端及存储介质,能够实现监控逻辑的拓展,满足监控多变的需求场景,无需额外引进监控系统,降低了使用成本。The present invention aims to solve at least one of the technical problems existing in the prior art. To this end, the present invention proposes a link monitoring method, an alarm method, a system, a terminal and a storage medium, which can realize the expansion of monitoring logic, meet the changing demand scenarios of monitoring, do not need to introduce additional monitoring systems, and reduce the use cost.
根据本发明的第一方面实施例的链路监控方法,用于网络监控系统,包括:获取第一监控探针的标识ID;同步第二监控探针,将所述第二监控探针的标识ID设置为所述第一监控探针的标识ID;获取第一监控指标和第二监控指标,所述第二监控指标封装有SPAN标签;上报所述第一监控指标和所述第二监控指标。The link monitoring method according to the embodiment of the first aspect of the present invention, used in a network monitoring system, includes: acquiring an identification ID of a first monitoring probe; synchronizing a second monitoring probe, and converting the identification of the second monitoring probe The ID is set to the identification ID of the first monitoring probe; the first monitoring indicator and the second monitoring indicator are obtained, and the second monitoring indicator is encapsulated with a SPAN tag; the first monitoring indicator and the second monitoring indicator are reported .
根据本发明实施例的链路监控方法,至少具有如下有益效果:实现第一监控探针与第二监控探针的同步,对链路和应用分别进行监控,无需引进额外的监控系统,降低了链路监控的复杂度。The link monitoring method according to the embodiment of the present invention has at least the following beneficial effects: realizing the synchronization of the first monitoring probe and the second monitoring probe, monitoring the link and the application separately, without introducing an additional monitoring system, reducing the The complexity of link monitoring.
根据本发明的一些实施例,所述第一监控探针用于监控第一监控指标,所述第一监控指标包括监控应用指标;所述第二监控探针用于监控第二监控指标,所述第二监控指标包括监控链路指标。According to some embodiments of the present invention, the first monitoring probe is used to monitor a first monitoring indicator, and the first monitoring indicator includes monitoring application indicators; the second monitoring probe is used to monitor a second monitoring indicator, so The second monitoring indicator includes monitoring link indicators.
根据本发明的一些实施例,所述链路监控方法还包括:所述链路指标在应用运行时生成,所述链路指标用于采集链路指标数据。According to some embodiments of the present invention, the link monitoring method further includes: the link indicator is generated when the application is running, and the link indicator is used to collect link indicator data.
根据本发明的一些实施例,所述链路监控方法还包括:异步上报所述第二监控指标至缓冲区;上报所述第二监控指标至服务器。According to some embodiments of the present invention, the link monitoring method further includes: asynchronously reporting the second monitoring indicator to a buffer; reporting the second monitoring indicator to a server.
根据本发明的第二方面实施例的告警方法,获取如第一方面所述的链路监控方法上报的第二监控指标;所述根据所述第二监控指标获取所述SPAN标签;根据所述第二监控指标获取所述SPAN标签;根据所述SPAN标签获取监控链路指标;判断所述监控链路指标是否超过告警阈值;若未超过告警阈值,则结束阈值告警;若超过告警阈值,则进一步判断所述监控链路指标时间是否聚合;若聚合,则获取当前链路指标数据,根据所述当前链路指标数据时间范围内数据做聚合计算,并进入参数校验流程;若不聚合,则进入参数校验流程。According to the alarm method of the embodiment of the second aspect of the present invention, the second monitoring index reported by the link monitoring method according to the first aspect is obtained; the SPAN label is obtained according to the second monitoring index; according to the The second monitoring index obtains the SPAN label; obtains the monitoring link index according to the SPAN label; judges whether the monitoring link index exceeds the alarm threshold; if it does not exceed the alarm threshold, end the threshold alarm; if it exceeds the alarm threshold, then Further determine whether the monitoring link indicator time is aggregated; if aggregated, obtain the current link indicator data, perform aggregation calculation according to the data within the current link indicator data time range, and enter the parameter verification process; if not aggregated, Then enter the parameter verification process.
根据本发明实施例的链路监控方法,至少具有如下有益效果:根据获取的链路数据获取监控链路指标,对告警信息进行处理可以保证告警数据的真实性,避免因为数据本身的错误造成AMS服务器的过载。提升了运维平台内接收到的告警数据的真实性,降低了对运维平台的要求。The link monitoring method according to the embodiment of the present invention has at least the following beneficial effects: the monitoring link index is obtained according to the obtained link data, and the alarm information is processed to ensure the authenticity of the alarm data and avoid the AMS caused by the error of the data itself. Server overload. The authenticity of the alarm data received in the operation and maintenance platform is improved, and the requirements for the operation and maintenance platform are reduced.
根据本发明的一些实施例,所述参数校验流程包括:判断参数校验是否通过,若参数校验不通过,则结束阈值告警;若参数校验通过,则根据所述参数计算表达式;判断所述表达式是否为真,若所述表达式不为真,则结束阈值告警;若所述表达式为真,则进行告警。According to some embodiments of the present invention, the parameter verification process includes: judging whether the parameter verification passes, and if the parameter verification fails, ending a threshold alarm; if the parameter verification passes, calculating an expression according to the parameter; It is judged whether the expression is true, and if the expression is not true, the threshold alarm is terminated; if the expression is true, an alarm is performed.
根据本发明的一些实施例,所述告警方法还包括:根据所述SPAN标签获取所述SPAN标签的上下文关系;根据所述SPAN标签的上下文关系生成拓扑图。According to some embodiments of the present invention, the alarm method further includes: acquiring the context relationship of the SPAN label according to the SPAN label; and generating a topology map according to the context relationship of the SPAN label.
根据本发明的第三方面实施例的链路监控系统,包括:同步模块,所述同步模块用于同步所述第一监控探针与所述第二监控探针的标识ID;监控指标获取模块,所述监控指标获取模块用于获取第一监控指标和第二监控指标;封装模块,所述封装模块用于将SPAN标签封装进所述第二监控指标;上报模块,所述上报模块用于上报所述第一监控指标和所述第二监控指标。The link monitoring system according to the embodiment of the third aspect of the present invention includes: a synchronization module configured to synchronize the identification IDs of the first monitoring probe and the second monitoring probe; a monitoring index acquisition module , the monitoring index obtaining module is used to obtain the first monitoring index and the second monitoring index; the encapsulation module is used to encapsulate the SPAN label into the second monitoring index; the reporting module is used to The first monitoring indicator and the second monitoring indicator are reported.
根据本发明实施例的链路监控系统,至少具有如下有益效果:实现对链路的监控,对链路和应用分别进行监控,根据链路设置链路监控指标,并上报至运维平台,无需引进额外的监控系统,降低了链路监控的复杂度。The link monitoring system according to the embodiment of the present invention has at least the following beneficial effects: monitoring the link, monitoring the link and the application separately, setting the link monitoring index according to the link, and reporting it to the operation and maintenance platform, without the need for The introduction of an additional monitoring system reduces the complexity of link monitoring.
根据本发明的第四方面实施例的终端,包括:存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序,所述处理器执行所述程序时实现:执行如第一方面所述的链路监控方法;或者,如第二方面所述的告警方法。A terminal according to an embodiment of the fourth aspect of the present invention includes: a memory, a processor, and a computer program stored in the memory and executable on the processor, when the processor executes the program, it realizes: execute the program as described in the first aspect The link monitoring method; or, the alarm method according to the second aspect.
根据本发明实施例的计算机可读存储介质,至少具有如下有益效果:实现了对全链路的监控,无代码入侵应用,降低了对应用性能的影响,实现了应用与监控解耦。The computer-readable storage medium according to the embodiment of the present invention has at least the following beneficial effects: realizing the monitoring of the whole link, no code intrusion into the application, reducing the impact on the application performance, and realizing the decoupling of the application and the monitoring.
根据本发明的第五方面实施例的计算机可读存储介质,存储有计算机可执行指令,所述计算机可执行指令用于:如第一方面所述的链路监控方法;或者,如第二方面所述的告警方法。The computer-readable storage medium according to the embodiment of the fifth aspect of the present invention stores computer-executable instructions, where the computer-executable instructions are used for: the link monitoring method described in the first aspect; or, as in the second aspect the described alarm method.
根据本发明实施例的计算机可读存储介质,至少具有如下有益效果:实现对链路的监控,同时整合应用和链路监控,对告警信息进行筛选,上报符合规范的告警信息。The computer-readable storage medium according to the embodiment of the present invention has at least the following beneficial effects: realizing link monitoring, integrating application and link monitoring, screening alarm information, and reporting alarm information that conforms to specifications.
本发明的附加方面和优点将在下面的描述中部分给出,部分将从下面的描述中变得明显,或通过本发明的实践了解到。Additional aspects and advantages of the present invention will be set forth, in part, from the following description, and in part will be apparent from the following description, or may be learned by practice of the invention.
附图说明Description of drawings
本发明的上述和/或附加的方面和优点从结合下面附图对实施例的描述中将变得明显和容易理解,其中:The above and/or additional aspects and advantages of the present invention will become apparent and readily understood from the following description of embodiments taken in conjunction with the accompanying drawings, wherein:
图1为根据本发明实施例的链路监控方法的流程示意图;1 is a schematic flowchart of a link monitoring method according to an embodiment of the present invention;
图2为根据本发明一实施例的告警方法的流程示意图;FIG. 2 is a schematic flowchart of an alarm method according to an embodiment of the present invention;
图3为根据本发明另一实施例的告警方法的流程示意图。FIG. 3 is a schematic flowchart of an alarm method according to another embodiment of the present invention.
具体实施方式Detailed ways
下面详细描述本发明的实施例,所述实施例的示例在附图中示出,其中自始至终相同或类似的标号表示相同或类似的元件或具有相同或类似功能的元件。下面通过参考附图描述的实施例是示例性的,仅用于解释本发明,而不能理解为对本发明的限制。The following describes in detail the embodiments of the present invention, examples of which are illustrated in the accompanying drawings, wherein the same or similar reference numerals refer to the same or similar elements or elements having the same or similar functions throughout. The embodiments described below with reference to the accompanying drawings are exemplary, only used to explain the present invention, and should not be construed as a limitation of the present invention.
在本发明的描述中,需要理解的是,术语“中心”、“纵向”、“横向”、“长度”、“宽度”、“厚度”、“上”、“下”、“前”、“后”、“左”、“右”、“竖直”、“水平”、“顶”、“底”、“内”、“外”、“轴向”、“径向”、“周向”等指示的方位或位置关系为基于附图所示的方位或位置关系,仅是为了便于描述本发明和简化描述,而不是指示或暗示所指的装置或元件必须具有特定的方位、以特定的方位构造和操作,因此不能理解为对本发明的限制。此外,限定有“第一”、“第二”的特征可以明示或者隐含地包括一个或者更多个该特征。在本发明的描述中,除非另有说明,“多个”的含义是两个或两个以上。In the description of the present invention, it should be understood that the terms "center", "longitudinal", "lateral", "length", "width", "thickness", "upper", "lower", "front", " Rear, Left, Right, Vertical, Horizontal, Top, Bottom, Inner, Outer, Axial, Radial, Circumferential The orientation or positional relationship indicated by etc. is based on the orientation or positional relationship shown in the accompanying drawings, and is only for the convenience of describing the present invention and simplifying the description, rather than indicating or implying that the indicated device or element must have a specific orientation, with a specific orientation. The orientation configuration and operation are therefore not to be construed as limitations of the present invention. Furthermore, features delimited with "first", "second" may expressly or implicitly include one or more of that feature. In the description of the present invention, unless otherwise specified, "plurality" means two or more.
在本发明的描述中,需要说明的是,除非另有明确的规定和限定,术语“安装”、“相连”、“连接”应做广义理解,例如,可以是固定连接,也可以是可拆卸连接,或一体地连接;可以是机械连接,也可以是电连接;可以是直接相连,也可以通过中间媒介间接相连,可以是两个元件内部的连通。对于本领域的普通技术人员而言,可以具体情况理解上述术语在本发明中的具体含义。In the description of the present invention, it should be noted that the terms "installed", "connected" and "connected" should be understood in a broad sense, unless otherwise expressly specified and limited, for example, it may be a fixed connection or a detachable connection Connection, or integral connection; can be mechanical connection, can also be electrical connection; can be directly connected, can also be indirectly connected through an intermediate medium, can be internal communication between two elements. For those of ordinary skill in the art, the specific meanings of the above terms in the present invention can be understood in specific situations.
下面参考图1和描述根据本发明实施例的链路监控方法。The link monitoring method according to the embodiment of the present invention is described below with reference to FIG. 1 .
如图1,根据本发明实施例的链路监控方法,至少包括以下步骤:As shown in FIG. 1, a link monitoring method according to an embodiment of the present invention at least includes the following steps:
S100:获取第一监控探针的标识ID。S100: Obtain the identification ID of the first monitoring probe.
在一些实施例中,获取第一监控探针的识别ID,第一监控探针的识别ID与AMS运维平台的识别ID不相同。In some embodiments, the identification ID of the first monitoring probe is obtained, and the identification ID of the first monitoring probe is different from the identification ID of the AMS operation and maintenance platform.
S200:同步第二监控探针。S200: Synchronize the second monitoring probe.
在一些实施例中,同步第二监控探针,将第二监控探针的标识ID设置为第一监控探针的标识ID。第二监控探针可以是Skywalking agent。同步标识ID可以实现对AMS运维平台和基于OpenTrace监控的整合。In some embodiments, the second monitoring probe is synchronized, and the identification ID of the second monitoring probe is set as the identification ID of the first monitoring probe. The second monitoring probe may be a Skywalking agent. Synchronizing the ID can realize the integration of the AMS operation and maintenance platform and monitoring based on OpenTrace.
S300:获取第一监控指标和第二监控指标。S300: Obtain the first monitoring index and the second monitoring index.
在一些实施例中,获取第一监控指标和第二监控指标。第一监控指标包括应用指标,第二监控指标包括链路指标。In some embodiments, the first monitoring indicator and the second monitoring indicator are obtained. The first monitoring indicators include application indicators, and the second monitoring indicators include link indicators.
在一些实施例中,获取第一链路数据,并根据第一链路数据获取第一监控指标和第二监控指标。In some embodiments, the first link data is acquired, and the first monitoring indicator and the second monitoring indicator are acquired according to the first link data.
S600:上报第一监控指标和第二监控指标。S600: Report the first monitoring indicator and the second monitoring indicator.
在一些实施例中,上报第二监控指标时的通信框架包括使用gRPC框架进行数据传输。使用gRPC框架进行通信可以协同Protobuf,在实现大量数据传输的同时,不影响应用的性能。使用gPRC可以实现不同平台和语言之间的数据传输,满足分布式系统的需求。In some embodiments, the communication framework when reporting the second monitoring indicator includes using the gRPC framework for data transmission. Communication using the gRPC framework can cooperate with Protobuf to achieve a large amount of data transmission without affecting the performance of the application. Using gPRC can realize data transfer between different platforms and languages to meet the needs of distributed systems.
在一些实施例中,第一监控指标的获取方式包括使用应用性能监控系统Skywalking,Skywalking架构内包含Skywalking agent,Skywalking agent内封装有JavaAgent。Java Agent在JVM加载类和类之中的方法时,修改加载的类文件。Java Agent用于协助监测、运行其他JVM上的应用。In some embodiments, the acquisition method of the first monitoring indicator includes using an application performance monitoring system Skywalking, the Skywalking framework includes a Skywalking agent, and the Skywalking agent is packaged with a JavaAgent. The Java Agent modifies the loaded class file when the JVM loads the class and the methods in the class. Java Agent is used to assist in monitoring and running applications on other JVMs.
在一些实施例中,使用Protocol Buffers即Protobuf对第二监控指标及SPAN标签进行处理。使用Protobuf可以对数据进行序列化处理,将数据转换成能够被存储和传输的格式,同时这个序列化结果能够被重建回原来的数据。使用Protobuf进行数据处理可以支持跨平台和语言的数据处理,使数据可以在分布式平台进行传输,并能够进行解码。使用Protobuf可以将数据序列化为二进制编码,这会大幅减少需要传输的数据量,减少需要传输的数据量,提升数据传输的效率。In some embodiments, Protocol Buffers, namely Protobuf, are used to process the second monitoring indicator and the SPAN tag. Using Protobuf, data can be serialized, converted into a format that can be stored and transmitted, and the serialized result can be reconstructed back to the original data. Using Protobuf for data processing can support data processing across platforms and languages, so that data can be transmitted and decoded on distributed platforms. Using Protobuf can serialize data into binary code, which will greatly reduce the amount of data that needs to be transmitted, reduce the amount of data that needs to be transmitted, and improve the efficiency of data transmission.
在一些实施例中,使用Java Agent在方法执行前后增加链路指标采集逻辑。In some embodiments, a Java Agent is used to add link indicator collection logic before and after the method is executed.
在一些实施例中,第二监控指标上报至AMS运维监控平台,AMS运维监控平台可以根据监控链路指标进行对链路进行监控。In some embodiments, the second monitoring indicator is reported to the AMS operation and maintenance monitoring platform, and the AMS operation and maintenance monitoring platform can monitor the link according to the monitoring link indicator.
在一些实施例中,使用Protobuf可以将数据序列化为二进制编码,这会大幅减少需要传输的数据量,提升数据传输的效率。同时,使用Protobuf可以降低对应用的影响,提升运维平台的性能。In some embodiments, using Protobuf can serialize data into binary codes, which greatly reduces the amount of data to be transmitted and improves the efficiency of data transmission. At the same time, using Protobuf can reduce the impact on the application and improve the performance of the operation and maintenance platform.
在一些实施例中,获取第一监控探针的标识ID;同步第二监控探针,将第二监控探针的标识ID设置为第一监控探针的标识ID;获取第一监控指标和第二监控指标;获取第一链路数据;获取第一监控指标和第二监控指标,第二监控指标封装有SPAN标签;上报第一监控指标和第二监控指标。In some embodiments, the identification ID of the first monitoring probe is obtained; the second monitoring probe is synchronized, and the identification ID of the second monitoring probe is set as the identification ID of the first monitoring probe; the first monitoring indicator and the first monitoring probe are obtained. Two monitoring indicators; acquiring first link data; acquiring first monitoring indicators and second monitoring indicators, the second monitoring indicators being encapsulated with SPAN tags; reporting the first monitoring indicators and the second monitoring indicators.
在一些实施例中,第一监控探针用于监控第一监控指标,第一监控指标包括监控应用指标;第二监控探针用于监控第二监控指标,第二监控指标包括监控链路指标。In some embodiments, the first monitoring probe is used to monitor the first monitoring indicator, and the first monitoring indicator includes monitoring application indicators; the second monitoring probe is used to monitor the second monitoring indicator, and the second monitoring indicator includes monitoring link indicators .
在一些实施例中,SPAN标签根据监控链路指标设置。In some embodiments, the SPAN tag is set according to monitoring link metrics.
在一些实施例中,使用Java Agent在方法执行前后增加监控链路指标采集逻辑。应用运行驱动监控链路指标的产生,根据采集逻辑采集到的监控链路指标进行上报。In some embodiments, a Java Agent is used to add monitoring link indicator collection logic before and after the method is executed. The application operation drives the generation of monitoring link indicators, and reports the monitoring link indicators collected by the collection logic.
在一些实施例中,针对应用模块编写监控插件,监控插件基于Skywalking Agent,可以实现对应用的无代码入侵。同时,基于不同的应用设置不同的监控插件,可以实现对链路的差异化监控。In some embodiments, a monitoring plug-in is written for the application module, and the monitoring plug-in is based on the Skywalking Agent, which can implement codeless intrusion to the application. At the same time, different monitoring plug-ins can be set based on different applications to realize differentiated monitoring of links.
在一些实施例中,第一监控探针用于监控第一监控指标,第一监控指标包括监控应用指标;第二监控探针用于监控第二监控指标,第一监控指标包括监控链路指标。In some embodiments, the first monitoring probe is used to monitor the first monitoring indicator, and the first monitoring indicator includes monitoring application indicators; the second monitoring probe is used to monitor the second monitoring indicator, and the first monitoring indicator includes monitoring link indicators .
在一些实施例中,应用指标包括应用模块指标,用于监控应用模块的状态。In some embodiments, application metrics include application module metrics for monitoring the status of application modules.
在一些实施例中,监控链路指标包括trace指标,用于监控链路的状态。In some embodiments, monitoring link metrics includes trace metrics for monitoring the status of the link.
在一些实施例中,监控链路指标可以根据应用的不同进行设置,实现对链路指标的灵活监控。In some embodiments, the monitoring link indicator can be set according to different applications, so as to realize flexible monitoring of the link indicator.
在一些实施例中,第二监控指标数据根据开放分布式追踪OpenTrace规范进行封装,开放分布式追踪OpenTrace用于在不同的平台和系统进行数据追踪。In some embodiments, the second monitoring indicator data is encapsulated according to the open distributed tracing OpenTrace specification, and the open distributed tracing OpenTrace is used for data tracing on different platforms and systems.
在一些实施例中,监控链路指标在应用运行时生成,监控链路指标用于在链路传输时采集指标数据。In some embodiments, the monitoring link indicator is generated when the application is running, and the monitoring link indicator is used to collect indicator data during link transmission.
如图2所示,根据本发明实施例的告警方法至少包括以下步骤:As shown in FIG. 2 , the alarm method according to the embodiment of the present invention includes at least the following steps:
S701:根据第二链路数据获取SPAN标签。S701: Acquire a SPAN label according to the second link data.
在一些实施例中,解析第二链路数据,获取封装的SPAN标签。第二链路数据根据开放分布式追踪OpenTrace规范进行封装,因此可以在不同的平台进行解析。In some embodiments, the second link data is parsed to obtain the encapsulated SPAN tag. The second link data is encapsulated according to the open distributed tracing OpenTrace specification, so it can be parsed on different platforms.
S702:根据SPAN标签获取监控链路指标。S702: Acquire monitoring link indicators according to the SPAN label.
在一些实施例中,根据SPAN标签的内容获取监控链路指标,链路指标根据应用的不同进行设置,并在应用运行的过程中产生和上报。In some embodiments, the monitoring link indicator is acquired according to the content of the SPAN tag, the link indicator is set according to different applications, and is generated and reported during the running of the application.
S703:判断监控链路指标是否超过告警阈值。S703: Determine whether the monitoring link indicator exceeds an alarm threshold.
在一些实施例中,判断监控链路指标中的数值是否超过告警阈值,告警阈值可以根据不同的链路指标进行设置,同时还可以根据应用的不同和需求进行设置。In some embodiments, it is determined whether the value in the monitored link indicator exceeds an alarm threshold, and the alarm threshold can be set according to different link indicators, and can also be set according to different applications and requirements.
S704:判断监控链路指标时间是否聚合。S704: Determine whether the monitoring link indicator time is aggregated.
在一些实施例中,判断监控链路指标时间是否聚合,若监控链路指标时间聚合则进行聚合计算,若监控链路指标时间不聚合,则对监控链路指标进行参数校验流程,避免AMS运维平台因获取错误的数据而发出错误告警。In some embodiments, it is determined whether the monitoring link indicator time is aggregated, and if the monitoring link indicator time is aggregated, the aggregation calculation is performed. If the monitoring link indicator time is not aggregated, the parameter verification process is performed on the monitoring link indicator to avoid AMS. The operation and maintenance platform issues an error alarm due to the acquisition of wrong data.
S705:获取当前链路指标数据。S705: Acquire current link indicator data.
在一些实施例中,获取当前链路指标数据,以进行监控链路指标时间的聚合。In some embodiments, current link indicator data is obtained for aggregation of monitoring link indicator times.
S706:当前链路指标数据时间范围内数据做聚合计算。S706: Perform aggregation calculation on the data within the current link indicator data time range.
在一些实施例中,根据当前链路指标数据时间范围内数据做聚合计算,以获取校验参数。In some embodiments, the aggregation calculation is performed according to the data in the current link indicator data time range to obtain the verification parameter.
S707:结束阈值告警。S707: End the threshold alarm.
在一些实施例中,若监控链路指标中的数值是否超过告警阈值未超过告警阈值,则结束阈值告警流程。避免因频繁的告警而影响运维系统的性能和稳定性。In some embodiments, if the value in the monitored link indicator exceeds the alarm threshold but does not exceed the alarm threshold, the threshold alarm process ends. Avoid affecting the performance and stability of the operation and maintenance system due to frequent alarms.
S800:参数校验流程。S800: Parameter verification process.
在一些实施例中,当监控链路指标时间聚合,或者经过处理后时间聚合,则进行参数校验流程。In some embodiments, when the monitored link indicators are aggregated over time, or aggregated over time after processing, a parameter verification process is performed.
在一些实施例中,链路监控方法还包括:根据第二监控数据获取SPAN标签;根据SPAN标签获取监控链路指标;判断监控链路指标是否超过告警阈值;若未超过告警阈值,则结束阈值告警;若超过告警阈值,则进一步判断监控链路指标时间是否聚合;若聚合,则获取当前链路指标数据,根据当前链路指标数据时间范围内数据做聚合计算,并进入参数校验流程;若不聚合,则进入参数校验流程。In some embodiments, the link monitoring method further includes: obtaining a SPAN label according to the second monitoring data; obtaining a monitoring link indicator according to the SPAN label; judging whether the monitoring link indicator exceeds an alarm threshold; if it does not exceed the alarm threshold, ending the threshold Alarm; if it exceeds the alarm threshold, it is further judged whether the monitoring link indicator time is aggregated; if it is aggregated, the current link indicator data is obtained, the aggregation calculation is performed according to the data within the current link indicator data time range, and the parameter verification process is entered; If not aggregated, enter the parameter verification process.
如图3所示,根据本发明实施例的告警方法至少包括以下步骤:As shown in FIG. 3 , the alarm method according to the embodiment of the present invention includes at least the following steps:
S801:判断参数校验是否通过。S801: Judge whether the parameter verification is passed.
在一些实施例中,判断参数校验是否通过包括当监控链路指标时间聚合时对聚合计算后的结果校验和当监控链路指标时间不聚合时对参数进行校验。In some embodiments, judging whether the parameter verification pass includes verifying the result after aggregation when the monitoring link indicators are aggregated in time and verifying the parameters when the monitoring link indicators are not aggregated in time.
S802:根据参数计算表达式。S802: Calculate the expression according to the parameter.
在一些实施例中,表达式根据Avaitor进行计算。In some embodiments, the expression is evaluated according to Avaitor.
S803:判断表达式是否为真。S803: Determine whether the expression is true.
在一些实施例中,判断由Avaitor计算得出的表达式是否为真,若未真,则判定需要告警,若不为真,则判定不需要告警。In some embodiments, it is determined whether the expression calculated by Avaitor is true, if not, it is determined that an alarm is required, and if not, it is determined that an alarm is not required.
S804:进行告警。S804: Generate an alarm.
在一些实施例中,若表达式为真,则进行告警,由于告警前经过数据校验,告警数据的真实性提高,可以降低AMS运维平台的运行压力。In some embodiments, if the expression is true, an alarm is performed. Since the data is verified before the alarm, the authenticity of the alarm data is improved, which can reduce the operation pressure of the AMS operation and maintenance platform.
S707:结束阈值告警。S707: End the threshold alarm.
在一些实施例中,若表达式不为真或参数校验未通过,则结束阈值告警,避免因参数错误而造成错误告警。In some embodiments, if the expression is not true or the parameter verification fails, the threshold alarm is terminated to avoid false alarms caused by incorrect parameters.
在一些实施例中,参数校验流程包括:判断参数校验是否通过,若参数校验不通过,则结束阈值告警;若参数校验通过,则根据参数计算表达式;判断表达式是否为真,若表达式不为真,则结束阈值告警;若表达式为真,则进行告警。In some embodiments, the parameter verification process includes: judging whether the parameter verification passes, and if the parameter verification fails, ending a threshold alarm; if the parameter verification passes, calculating an expression according to the parameter; judging whether the expression is true , if the expression is not true, the threshold alarm is terminated; if the expression is true, the alarm is performed.
在一些实施例中,使用参数校验流程可以保证告警数据的真实性,避免因为数据本身的错误造成AMS服务器的过载。提升了运维平台内接收到的告警数据的真实性,降低了对运维平台的要求。In some embodiments, the use of the parameter verification process can ensure the authenticity of the alarm data, and avoid overloading the AMS server due to errors in the data itself. The authenticity of the alarm data received in the operation and maintenance platform is improved, and the requirements for the operation and maintenance platform are reduced.
根据本发明实施例的链路监控系统,包括:同步模块,同步模块用于同步第一监控探针与第二监控探针的标识ID;监控指标获取模块,监控指标获取模块用于获取第一监控指标和第二监控指标;链路获取模块,链路获取模块用于获取第一链路数据;封装模块,封装模块用于将SPAN标签封装进第一链路数据形成第二链路数据;上报模块,上报模块用于上报第一监控指标和第二链路数据。A link monitoring system according to an embodiment of the present invention includes: a synchronization module for synchronizing the identification IDs of the first monitoring probe and the second monitoring probe; a monitoring indicator acquisition module for acquiring the first monitoring indicator Monitoring indicators and second monitoring indicators; a link acquisition module, the link acquisition module is used to acquire the first link data; the encapsulation module, the encapsulation module is used to encapsulate the SPAN label into the first link data to form the second link data; A reporting module, where the reporting module is used to report the first monitoring indicator and the second link data.
根据本发明实施例的终端,包括:存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序,处理器执行程序时实现:如上述的链路监控方法;或者,如上述的告警方法。A terminal according to an embodiment of the present invention includes: a memory, a processor, and a computer program stored in the memory and running on the processor. When the processor executes the program, the processor implements: the link monitoring method described above; or, as described above Alert method.
在一些实施例中,终端设备可以是桌上型计算机、笔记本、掌上电脑及云端服务器等计算设备。上述处理器可以是中央处理单元(Central Processing Unit,CPU),还可以是其他通用处理器、数字信号处理器(Digital Signal Processor,DSP)、专用集成电路(Application Specific Integrated Circuit,ASIC)、现成可编程门阵列(Field-Programmable Gate Array,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等,处理器是上述链路监控系统的管理终端设备的控制中心,利用各种接口和线路连接整个链路监控系统的管理终端设备的各个部分。In some embodiments, the terminal device may be a computing device such as a desktop computer, a notebook computer, a palmtop computer, and a cloud server. The above-mentioned processor may be a central processing unit (Central Processing Unit, CPU), and may also be other general-purpose processors, digital signal processors (Digital Signal Processors, DSP), application specific integrated circuits (Application Specific Integrated Circuit, ASIC), off-the-shelf processors Field-Programmable Gate Array (FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc. The general-purpose processor can be a microprocessor or the processor can also be any conventional processor, etc. The processor is the control center of the management terminal equipment of the above-mentioned link monitoring system, and uses various interfaces and lines to connect the entire link monitoring system. to manage various parts of the terminal device.
在一些实施例中,存储器可主要包括存储程序区和存储数据区,其中,存储程序区可存储操作系统、至少一个功能所需的应用程序等;存储数据区可存储根据手机的使用所创建的数据等。此外,存储器可以包括高速随机存取存储器,还可以包括非易失性存储器,例如硬盘、内存、插接式硬盘,智能存储卡(Smart Media Card, SMC),安全数字(SecureDigital, SD)卡,闪存卡(Flash Card)、至少一个磁盘存储器件、闪存器件、或其他易失性固态存储器件。In some embodiments, the memory may mainly include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application program required for at least one function, and the like; the storage data area may store data created according to the use of the mobile phone. data etc. In addition, the memory may include high-speed random access memory, and may also include non-volatile memory such as hard disk, internal memory, plug-in hard disk, Smart Media Card (SMC), Secure Digital (SD) card, Flash Card, at least one magnetic disk storage device, flash memory device, or other volatile solid state storage device.
根据本发明实施例的计算机可读存储介质,存储有计算机可执行指令,计算机可执行指令用于:执行如上述的链路监控方法;或者,执行如上述的告警方法。The computer-readable storage medium according to the embodiment of the present invention stores computer-executable instructions, where the computer-executable instructions are used to: execute the above-mentioned link monitoring method; or, execute the above-mentioned alarm method.
在一些实施例中,计算机程序可存储于一计算机可读存储介质中,该计算机程序在被处理器执行时,可实现上述各个方法实施例的步骤。其中,计算机程序包括计算机程序代码,计算机程序代码可以为源代码形式、对象代码形式、可执行文件或某些中间形式等。计算机可读介质可以包括:能够携带计算机程序代码的任何实体或装置、记录介质、U盘、移动硬盘、磁碟、光盘、计算机存储器、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、电载波信号、电信信号以及软件分发介质等。需要说明的是,计算机可读介质包含的内容可以根据司法管辖区内立法和专利实践的要求进行适当的增减,例如在某些司法管辖区,根据立法和专利实践,计算机可读介质不包括电载波信号和电信信号。In some embodiments, a computer program may be stored in a computer-readable storage medium, and when executed by a processor, the computer program may implement the steps of the above-described method embodiments. Wherein, the computer program includes computer program code, and the computer program code may be in the form of source code, object code, executable file or some intermediate form. Computer-readable media may include: any entity or device capable of carrying computer program code, recording media, USB flash drives, removable hard disks, magnetic disks, optical discs, computer memory, read-only memory (ROM, Read-Only Memory), random access Memory (RAM, Random Access Memory), electric carrier signal, telecommunication signal and software distribution medium, etc. It should be noted that the content contained in the computer-readable media may be appropriately increased or decreased as required by the legislation and patent practice in the jurisdiction. For example, in some jurisdictions, according to the legislation and patent practice, the computer-readable media does not include Electrical carrier signals and telecommunication signals.
以上所描述的装置实施例仅仅是示意性的,其中作为分离部件说明的单元可以是或者也可以不是物理上分开的,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部模块来实现本实施例方案的目的。The apparatus embodiments described above are only illustrative, and the units described as separate components may or may not be physically separated, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution in this embodiment.
在本说明书的描述中,参考术语“一个实施例”、“一些实施例”、“示意性实施例”、“示例”、“具体示例”、或“一些示例”等的描述意指结合该实施例或示例描述的具体特征、结构、材料或者特点包含于本发明的至少一个实施例或示例中。在本说明书中,对上述术语的示意性表述不一定指的是相同的实施例或示例。而且,描述的具体特征、结构、材料或者特点可以在任何的一个或多个实施例或示例中以合适的方式结合。In the description of this specification, reference to the terms "one embodiment," "some embodiments," "exemplary embodiment," "example," "specific example," or "some examples", etc., is meant to incorporate the embodiments A particular feature, structure, material, or characteristic described by an example or example is included in at least one embodiment or example of the present invention. In this specification, schematic representations of the above terms do not necessarily refer to the same embodiment or example. Furthermore, the particular features, structures, materials or characteristics described may be combined in any suitable manner in any one or more embodiments or examples.
尽管已经示出和描述了本发明的实施例,本领域的普通技术人员可以理解:在不脱离本发明的原理和宗旨的情况下可以对这些实施例进行多种变化、修改、替换和变型,本发明的范围由权利要求及其等同物限定。Although embodiments of the present invention have been shown and described, it will be understood by those of ordinary skill in the art that various changes, modifications, substitutions and alterations can be made in these embodiments without departing from the principles and spirit of the invention, The scope of the invention is defined by the claims and their equivalents.
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202010649744.1ACN111541590B (en) | 2020-07-08 | 2020-07-08 | Link monitoring method, alarm method, system, terminal and storage medium |
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202010649744.1ACN111541590B (en) | 2020-07-08 | 2020-07-08 | Link monitoring method, alarm method, system, terminal and storage medium |
| Publication Number | Publication Date |
|---|---|
| CN111541590Atrue CN111541590A (en) | 2020-08-14 |
| CN111541590B CN111541590B (en) | 2020-11-06 |
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202010649744.1AActiveCN111541590B (en) | 2020-07-08 | 2020-07-08 | Link monitoring method, alarm method, system, terminal and storage medium |
| Country | Link |
|---|---|
| CN (1) | CN111541590B (en) |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN112035317A (en)* | 2020-08-28 | 2020-12-04 | 北京浪潮数据技术有限公司 | Micro-service link monitoring method, device, equipment and medium |
| CN112118153A (en)* | 2020-09-06 | 2020-12-22 | 苏州浪潮智能科技有限公司 | A link monitoring method and system based on grpc and spring mvc |
| CN112286769A (en)* | 2020-11-20 | 2021-01-29 | 成都完美天智游科技有限公司 | Monitoring method, monitoring device, computer equipment and storage medium |
| CN114697232A (en)* | 2020-12-25 | 2022-07-01 | 中国移动通信集团福建有限公司 | Skywalking probe index data acquisition system and method and electronic equipment |
| CN115495319A (en)* | 2022-10-24 | 2022-12-20 | 康键信息技术(深圳)有限公司 | Service link monitoring method and device, storage medium and computer equipment |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20020042830A1 (en)* | 2000-03-31 | 2002-04-11 | Subhra Bose | System, method and applications real-time messaging over HTTP-based protocols |
| CN103997432A (en)* | 2013-07-31 | 2014-08-20 | 中国人民解放军理工大学 | Measuring system and method for supporting analysis of OpenFlow application performance |
| CN108628719A (en)* | 2018-04-19 | 2018-10-09 | 杭州朗和科技有限公司 | Using the acquisition methods of data, medium, device and computing device |
| CN109818823A (en)* | 2018-12-29 | 2019-05-28 | 北京奥鹏远程教育中心有限公司 | Full link trace system |
| CN110535902A (en)* | 2019-07-16 | 2019-12-03 | 平安普惠企业管理有限公司 | The processing method and electronic device of monitoring intelligent are realized based on micro services |
| CN110611715A (en)* | 2019-09-23 | 2019-12-24 | 国云科技股份有限公司 | System and method for collecting cloud monitoring information by service link |
| CN110688277A (en)* | 2019-09-12 | 2020-01-14 | 上海易点时空网络有限公司 | Data monitoring method and device for micro-service framework |
| CN110888780A (en)* | 2019-11-19 | 2020-03-17 | 泰康保险集团股份有限公司 | Application monitoring method, device, equipment and storage medium |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20020042830A1 (en)* | 2000-03-31 | 2002-04-11 | Subhra Bose | System, method and applications real-time messaging over HTTP-based protocols |
| CN103997432A (en)* | 2013-07-31 | 2014-08-20 | 中国人民解放军理工大学 | Measuring system and method for supporting analysis of OpenFlow application performance |
| CN108628719A (en)* | 2018-04-19 | 2018-10-09 | 杭州朗和科技有限公司 | Using the acquisition methods of data, medium, device and computing device |
| CN109818823A (en)* | 2018-12-29 | 2019-05-28 | 北京奥鹏远程教育中心有限公司 | Full link trace system |
| CN110535902A (en)* | 2019-07-16 | 2019-12-03 | 平安普惠企业管理有限公司 | The processing method and electronic device of monitoring intelligent are realized based on micro services |
| CN110688277A (en)* | 2019-09-12 | 2020-01-14 | 上海易点时空网络有限公司 | Data monitoring method and device for micro-service framework |
| CN110611715A (en)* | 2019-09-23 | 2019-12-24 | 国云科技股份有限公司 | System and method for collecting cloud monitoring information by service link |
| CN110888780A (en)* | 2019-11-19 | 2020-03-17 | 泰康保险集团股份有限公司 | Application monitoring method, device, equipment and storage medium |
| Title |
|---|
| 石爱武;王辉: "基于微服务架构的石油业务流程共享中心应用探讨", 《石化技术》* |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN112035317A (en)* | 2020-08-28 | 2020-12-04 | 北京浪潮数据技术有限公司 | Micro-service link monitoring method, device, equipment and medium |
| CN112035317B (en)* | 2020-08-28 | 2022-06-17 | 北京浪潮数据技术有限公司 | Micro-service link monitoring method, device, equipment and medium |
| CN112118153A (en)* | 2020-09-06 | 2020-12-22 | 苏州浪潮智能科技有限公司 | A link monitoring method and system based on grpc and spring mvc |
| CN112118153B (en)* | 2020-09-06 | 2022-12-27 | 苏州浪潮智能科技有限公司 | Grpc and spring mvc-based link monitoring method and system |
| CN112286769A (en)* | 2020-11-20 | 2021-01-29 | 成都完美天智游科技有限公司 | Monitoring method, monitoring device, computer equipment and storage medium |
| CN112286769B (en)* | 2020-11-20 | 2023-10-20 | 成都完美天智游科技有限公司 | Monitoring method, monitoring device, computer equipment and storage medium |
| CN114697232A (en)* | 2020-12-25 | 2022-07-01 | 中国移动通信集团福建有限公司 | Skywalking probe index data acquisition system and method and electronic equipment |
| CN114697232B (en)* | 2020-12-25 | 2024-02-23 | 中国移动通信集团福建有限公司 | System and method for acquiring index data of Skywalk probe and electronic equipment |
| CN115495319A (en)* | 2022-10-24 | 2022-12-20 | 康键信息技术(深圳)有限公司 | Service link monitoring method and device, storage medium and computer equipment |
| Publication number | Publication date |
|---|---|
| CN111541590B (en) | 2020-11-06 |
| Publication | Publication Date | Title |
|---|---|---|
| CN111541590B (en) | Link monitoring method, alarm method, system, terminal and storage medium | |
| CN106878064B (en) | Data monitoring method and device | |
| CN111459782B (en) | Method and device for monitoring service system, cloud platform system and server | |
| CN112994955A (en) | Upgrade package sending management method, incremental upgrade package preparation method and related device | |
| CN114500690A (en) | Interface data processing method and device, electronic equipment and storage medium | |
| CN114357001A (en) | Multi-cluster data query method, device, monitoring platform and storage medium | |
| CN111506641B (en) | Data management method, data acquisition platform, data management system and storage medium | |
| CN110908910B (en) | Block chain-based test monitoring method and device and readable storage medium | |
| CN115865645A (en) | Link determination method, device, equipment, medium and product | |
| CN116016653A (en) | Information pushing method and device of blockchain, electronic equipment and storage medium | |
| CN108512675A (en) | A kind of method, apparatus of network diagnosis, control node and network node | |
| CN114416420A (en) | Equipment problem feedback method and system | |
| CN114239296A (en) | Data processing method and device and electronic equipment | |
| CN111107143B (en) | Network file transmission detection method, device and system | |
| CN113824755A (en) | Method, system and related device for blockchain data processing | |
| CN120011317A (en) | Data verification method, electronic equipment, device and storage medium | |
| US10467082B2 (en) | Device driver verification | |
| CN115511316A (en) | Interface model determining method, device and readable storage medium | |
| CN115061891A (en) | System load capacity prediction method and device based on block chain | |
| CN114416744A (en) | Identification data generation method, device, computer equipment and storage medium | |
| CN111949474A (en) | Message monitoring method, warning method, system, terminal and storage medium | |
| CN113791922B (en) | Exception handling method, system and device for distributed storage system | |
| CN113656324B (en) | Full-link test method, device, equipment and medium for disease input and decision | |
| CN119127545B (en) | Fault processing method, device and equipment | |
| CN114386838B (en) | Indicator configuration method, device, computer equipment and computer readable storage medium |
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant | ||
| CP02 | Change in the address of a patent holder | ||
| CP02 | Change in the address of a patent holder | Address after:510308 room 23bd, No. 109, TIYU West Road, Tianhe District, Guangzhou City, Guangdong Province Patentee after:GUANGZHOU XUANWU WIRELESS TECHNOLOGY Co.,Ltd. Address before:510620 unit 01, 32F, building B, Victoria Plaza, 103 TIYU West Road, Tianhe District, Guangzhou City, Guangdong Province Patentee before:GUANGZHOU XUANWU WIRELESS TECHNOLOGY Co.,Ltd. | |
| PE01 | Entry into force of the registration of the contract for pledge of patent right | ||
| PE01 | Entry into force of the registration of the contract for pledge of patent right | Denomination of invention:Link monitoring methods, alarm methods, systems, terminals, and storage media Granted publication date:20201106 Pledgee:Bank of China Limited Dongshan Branch of Guangzhou Pledgor:GUANGZHOU XUANWU WIRELESS TECHNOLOGY Co.,Ltd. Registration number:Y2025980013479 |