JP2009252050A

Movatterモバイル変換

Info

Publication number: JP2009252050A
Application number: JP2008100846A
Authority: JP
Inventors: Teruyuki Baba; 輝幸馬場; Atsuhiro Tanaka; 淳裕田中
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2008-04-08
Filing date: 2008-04-08
Publication date: 2009-10-29

Abstract

<P>PROBLEM TO BE SOLVED: To appropriately determine server loads even when post-processing occurs in a computer system composed of a plurality of servers. <P>SOLUTION: A server load management system managing the loads of the plurality of servers includes a storage device storing a performance index variation table showing a time-series variation of a performance index of each server; and a processor calculating the server loads by referring to the table. The index includes a CPU-use ratio and a request index giving an index of the number of requests in processing. The processor has a tendency determination unit and a load calculation unit. The tendency determination unit determines whether the number of requests in processing is in an increase tendency or in a decrease tendency with respect to each of the plurality of servers on the basis of the time-series variation of the request index. A load calculation unit corrects the CPU-use ratio on the basis of the determination result to calculate the corrected CPU-use ratio as the server load. If the determination result is in the increase tendency, the CPU-use ratio is revised upward, and if the determination result is in the decrease tendency, the CPU-use ratio is revised downward. <P>COPYRIGHT: (C)2010,JPO&INPIT

Description

Translated fromJapanese

本発明は、複数のサーバの負荷を管理するための技術に関する。特に、本発明は、後処理を考慮して複数のサーバの負荷を管理するための技術に関する。 The present invention relates to a technique for managing loads of a plurality of servers. In particular, the present invention relates to a technique for managing loads of a plurality of servers in consideration of post-processing.

複数のサーバから構成されるコンピュータシステムにおいて、各サーバの負荷を把握することは、サーバ管理、サーバへのリクエストの割り当て、サーバ負荷の分散などの観点から重要である。計算機システムにおける負荷分散に関連する技術として、次のものが知られている。 In a computer system composed of a plurality of servers, grasping the load on each server is important from the viewpoints of server management, allocation of requests to servers, distribution of server loads, and the like. The following are known as techniques related to load distribution in a computer system.

特許文献１（特開２００１−１０９６３８号公報）には、推定伸長率に基づくトランザクション負荷分散方法が記載されている。この方法によれば、一定時間ごとに、各計算機の処理中トランザクション数、およびＣＰＵ系に滞在するプロセス数またはＣＰＵ使用率が測定される。次に、これらの測定値を元に、各計算機における処理時間の伸長率が推定される。次に、この推定伸長率に基づいて、各計算機の負荷指標の値が求められる。そして、求められた負荷指標に基づいて、負荷の低い計算機へトランザクションの配分が行われる。 Patent Document 1 (Japanese Patent Application Laid-Open No. 2001-109638) describes a transaction load distribution method based on an estimated expansion rate. According to this method, the number of transactions being processed in each computer and the number of processes staying in the CPU system or the CPU usage rate are measured at regular intervals. Next, based on these measured values, the expansion rate of the processing time in each computer is estimated. Next, based on this estimated expansion rate, the value of the load index of each computer is obtained. Then, based on the obtained load index, the transaction is distributed to a computer with a low load.

特許文献２（特開２０００−１１２９０９号公報）には、負荷分散管理システムが記載されている。処理履歴保存部は、各処理要求に対する処理履歴からなる処理履歴情報を記憶装置に保存する。処理先決定部は、処理要求元クライアントコンピュータからの処理要求が発生した場合に、当該処理要求に対する処理先サーバコンピュータを、処理履歴情報に基づいて決定する。処理要求管理部は、処理要求を処理先サーバコンピュータに転送し、当該処理先サーバコンピュータによる処理結果を当該処理要求元クライアントコンピュータに転送し、当該処理結果を反映した処理履歴を処理履歴保存部に転送する。 Japanese Patent Application Laid-Open No. 2000-112909 describes a load distribution management system. The processing history storage unit stores processing history information including processing history for each processing request in the storage device. When a processing request from a processing request source client computer occurs, the processing destination determination unit determines a processing destination server computer for the processing request based on the processing history information. The processing request management unit transfers the processing request to the processing destination server computer, transfers the processing result by the processing destination server computer to the processing request source client computer, and stores the processing history reflecting the processing result in the processing history storage unit. Forward.

特許文献３（特開２００１−７５９３４号公報）には、負荷分散処理システムが記載されている。負荷レポート送信手段は、処理実行本体の負荷が高くなると負荷通知の発信間隔を長くし、負荷が低くなると発信間隔を短くする。負荷レポート受付手段は、負荷通知の受信間隔の長さから負荷率を算出し記録する。処理要求制御手段は、処理要求を受け付けて、選択された処理実行本体へ送信する。負荷調整手段は、負荷率を処理割り当て時の時刻で補正し、補正後の負荷率が最小な処理実行本体を選択する。 Patent Document 3 (Japanese Patent Laid-Open No. 2001-75934) describes a load distribution processing system. The load report transmission means lengthens the transmission interval of the load notification when the load of the process execution main body is high, and shortens the transmission interval when the load is low. The load report receiving means calculates and records the load factor from the length of the load notification reception interval. The processing request control means receives the processing request and transmits it to the selected processing execution body. The load adjustment unit corrects the load factor at the time when the process is allocated, and selects a process execution body having the smallest corrected load factor.

特許文献４（特開２００５−２８４６９５号公報）には、情報要求装置からネットワークを介してなされる情報要求に応じて生じる情報提供装置の負荷を分散する負荷分散装置が記載されている。資源情報収集手段は、情報提供装置の１または２以上の資源の使用情報を収集する。情報要求頻度操作手段は、情報要求装置から情報提供装置への情報要求頻度を操作する。反応係数算出手段は、操作による情報要求頻度の変化分と、操作した情報要求頻度に応じて動作した情報提供装置における使用情報の変化分とから、反応係数を算出する。 Japanese Patent Application Laid-Open No. 2005-284695 describes a load distribution device that distributes the load of an information providing device generated in response to an information request made from an information request device via a network. The resource information collecting means collects usage information of one or more resources of the information providing apparatus. The information request frequency operating means operates the information request frequency from the information requesting device to the information providing device. The reaction coefficient calculation means calculates a reaction coefficient from the change in the information request frequency due to the operation and the change in the usage information in the information providing apparatus operated according to the operated information request frequency.

特許文献５（特開２００６−２５９８１２号公報）には、複数の計算機のキューに処理要求を振り分けて負荷分散を動的に行う動的キュー負荷分散方法が記載されている。その方法は、（１）計算機の負荷のレベルを示す負荷レベルを取得するステップと、（２）キュー管理テーブルのキューの長さを負荷レベルに対応するキューの長さへ変更するステップと、（３）キュー管理テーブル中のキューの長さ及び要求停滞数からキューの使用率を算出するステップと、（４）算出したキューの使用率の情報を通信装置により各計算機間で交換するステップと、（５）交換したキューの使用率に応じて処理要求を振り分けるステップと、を有する。 Patent Document 5 (Japanese Patent Laid-Open No. 2006-259812) describes a dynamic queue load distribution method in which processing requests are distributed to queues of a plurality of computers and load distribution is performed dynamically. The method includes (1) acquiring a load level indicating a load level of a computer, (2) changing a queue length of the queue management table to a queue length corresponding to the load level, and ( 3) a step of calculating the queue usage rate from the queue length and the number of request stagnation in the queue management table, and (4) a step of exchanging information on the calculated queue usage rate between the computers using a communication device; (5) a step of allocating processing requests according to the usage rate of the exchanged queue.

特許文献６（特開平１１−１１０３６０号公報）には、プロキシサーバ装置によるトランザクション分配方法が記載されている。プロキシサーバ装置は、過去のトランザクションサービスと応答時間の履歴から、トランザクション応答時間の分布を導出する。その導出された分布から、複数のサーバ装置の最大応答待ち時間がそれぞれ動的に決定される。プロキシサーバ装置は、決定された最大応答待ち時間に応じて、クライアント装置からのトランザクションリクエストをいずれかのサーバ装置に分配する。 Patent Document 6 (Japanese Patent Laid-Open No. 11-110360) describes a transaction distribution method by a proxy server device. The proxy server device derives a distribution of transaction response times from the past transaction service and response time history. From the derived distribution, the maximum response waiting times of the plurality of server devices are each dynamically determined. The proxy server device distributes the transaction request from the client device to one of the server devices according to the determined maximum response waiting time.

特開２００１−１０９６３８号公報Japanese Patent Laid-Open No. 2001-109638特開２０００−１１２９０９号公報JP 2000-112909 A特開２００１−７５９３４号公報JP 2001-75934 A特開２００５−２８４６９５号公報JP 2005-284695 A特開２００６−２５９８１２号公報JP 2006-259812 A特開平１１−１１０３６０号公報JP-A-11-110360

最近、コンピュータシステム上で動作するアプリケーションの中には「後処理」を行うものが多い。後処理とは、サーバがリクエストを処理し、ユーザあるいは前段階のサーバにレスポンスを返した後に、その処理に関連して行う処理のことである。 Recently, many applications running on computer systems perform “post-processing”. Post-processing is processing performed in connection with the processing after the server processes the request and returns a response to the user or the previous server.

Ｗｅｂサーバ、アプリケーションサーバ及びデータベースサーバの３層構成を有するコンピュータシステムを例に挙げて、後処理を説明する。このコンピュータシステムでは、Ｗｅｂサーバがユーザからのリクエスト（処理要求）を受け取り、受け取ったリクエストをアプリケーションサーバに送る。アプリケーションサーバは、要求された処理を実行する。このとき、アプリケーションサーバは、処理実行に必要なデータをデータベースサーバに要求する。データベースサーバは、要求されたデータを求めて登録データベースを検索し、読み出したデータをアプリケーションサーバに提供する。アプリケーションサーバは、当該処理の結果をＷｅｂサーバに送る。ユーザはＷｅｂブラウザを利用し、Ｗｅｂサーバからその処理結果を取得する。これらＷｅｂサーバ、アプリケーションサーバ及びデータベースサーバのそれぞれにおいて、後処理が発生することがある。 The post-processing will be described by taking a computer system having a three-layer configuration of a Web server, an application server, and a database server as an example. In this computer system, a Web server receives a request (processing request) from a user and sends the received request to an application server. The application server executes the requested process. At this time, the application server requests data necessary for processing execution from the database server. The database server searches the registration database for the requested data and provides the read data to the application server. The application server sends the processing result to the Web server. A user acquires the processing result from a Web server using a Web browser. Post processing may occur in each of these Web server, application server, and database server.

Ｗｅｂサーバでは、ユーザにレスポンスを返した後に実行することが許容される処理がある。例えば、ユーザのアクセス終了時間の記録などである。このように、ユーザへのレスポンス送信後に行なわれる処理が、Ｗｅｂサーバにおける「後処理」である。 In the Web server, there is a process that is allowed to be executed after a response is returned to the user. For example, recording of user access end time. As described above, the processing performed after the response is transmitted to the user is “post-processing” in the Web server.

アプリケーションサーバは、Ｗｅｂサーバから受け取ったリクエストに応答するために必要な処理を行い、Ｗｅｂサーバにレスポンスを返す。Ｗｅｂサーバへのレスポンス送信後に実行が許容される処理が、アプリケーションサーバにおける「後処理」である。アプリケーションサーバで行われる後処理の例は、皆本房幸著、「ＪＢｏｓｓ入門」、技術評論社、ｐ．２０２に記載されている。この文献に記載されているように、アプリケーションサーバの実現の一形態であるＥＪＢ（ＥｎｔｅｒｐｒｉｓｅＪａｖａ（登録商標）Ｂｅａｎ）では、メッセージ駆動型Ｂｅａｎという機能を用いて「後処理」が行われる。 The application server performs processing necessary to respond to the request received from the Web server, and returns a response to the Web server. Processing that is allowed to be executed after the response is transmitted to the Web server is “post-processing” in the application server. Examples of post-processing performed by the application server are written by Fumiyuki Minamoto, “Introduction to JBoss”, Technical Review, p. 202. As described in this document, in “EJB (Enterprise Java (registered trademark) Bean)” which is one form of implementation of an application server, “post-processing” is performed using a function called message-driven bean.

データベースサーバは、アプリケーションサーバから受け取ったリクエストに応答するために必要な処理を行い、アプリケーションサーバにレスポンスを返す。アプリケーションサーバへのレスポンス送信後に実行が許容される処理が、データベースサーバにおける「後処理」である。データベースサーバで行われる後処理の例は、次の論文に記載されている：A. Tanaka et al., “Capacity Planning for Client Server Database Systems: A Case Study”, Proceedings of the 1995 International Workshop on Computer Performance Measurement and Analysis, pp. 110-117。この論文は、データベースサーバＯｒａｃｌｅ（米国オラクル社の登録商標）の内部動作が、ユーザにレスポンスを返すＳｈａｄｏｗとＬＧＷプロセスと、それらとは非同期に実行されるＤＢｗｒｉｔｅｒｓとから構成されていることを開示している。 The database server performs processing necessary for responding to the request received from the application server, and returns a response to the application server. Processing that is allowed to be executed after the response is sent to the application server is “post-processing” in the database server. An example of post-processing performed in a database server is described in the following paper: A. Tanaka et al., “Capacity Planning for Client Server Database Systems: A Case Study”, Proceedings of the 1995 International Workshop on Computer Performance. Measurement and Analysis, pp. 110-117. This paper discloses that the internal operation of the database server Oracle (registered trademark of Oracle Corporation in the United States) is composed of Shadow and LGW processes that return responses to users and DBwriters that are executed asynchronously. ing.

このような後処理が発生する場合、サーバ負荷の判定において次のような問題が生じる。例えば、あるサーバがリクエスト処理後にユーザあるいは前段階のサーバにレスポンスを返し、処理中リクエスト数が減少する場合を考える。その後に発生する後処理は、リクエストを処理したプロセスとは異なるプロセスが担当する。当該サーバがレスポンスを返した時点で、処理中リクエスト数が減少するため、サーバ負荷が減少していると判断される。しかしながら、実際には、後処理に伴う負荷に関しては、その後処理を担当するプロセス内で該当処理が終了するまで減少することにはならない。上述の関連技術では、この後処理に伴う負荷が考慮されていない。従って、サーバ負荷が適切に判断されているとは言えない。その結果、計算機システムを管理する上で、サーバ負荷を均等にすることができないという問題も生じる。 When such post-processing occurs, the following problem occurs in determining the server load. For example, consider a case where a server returns a response to a user or a previous server after processing a request, and the number of requests being processed decreases. The post-processing that occurs thereafter is handled by a process different from the process that processed the request. When the server returns a response, the number of requests being processed decreases, so it is determined that the server load has decreased. However, in practice, the load associated with post-processing does not decrease until the corresponding processing is completed in the process in charge of the subsequent processing. In the related art described above, the load associated with this post-processing is not taken into consideration. Therefore, it cannot be said that the server load is appropriately determined. As a result, there arises a problem that the server load cannot be made equal in managing the computer system.

本発明の１つの目的は、後処理が発生する場合においてサーバ負荷を適切に判定することができる技術を提供することにある。 One object of the present invention is to provide a technique capable of appropriately determining a server load when post-processing occurs.

本発明の第１の観点において、複数のサーバの負荷を管理するサーバ負荷管理システムが提供される。そのサーバ負荷管理システムは、複数のサーバの各々の性能指標の時系列変化を示す性能指標変化テーブルが格納される記憶装置と、性能指標変化テーブルを参照してサーバ負荷を算出する処理装置と、を備える。性能指標は、ＣＰＵ使用率と、処理中のリクエスト数の指標を与えるリクエストインデックスとを含む。処理装置は、傾向判定部と負荷算出部とを有する。傾向判定部は、リクエストインデックスの時系列変化に基づいて、複数のサーバの各々に関して処理中のリクエスト数が増加傾向か減少傾向かの判定を行う。負荷算出部は、ＣＰＵ使用率を判定の結果に基づいて修正し、修正後ＣＰＵ使用率をサーバ負荷として算出する。具体的には、判定結果が増加傾向の場合、負荷算出部はＣＰＵ使用率を上方修正する。一方、判定結果が減少傾向の場合、負荷算出部はＣＰＵ使用率を下方修正する。 In a first aspect of the present invention, a server load management system for managing loads of a plurality of servers is provided. The server load management system includes a storage device that stores a performance index change table that indicates a time-series change of each performance index of a plurality of servers, a processing device that calculates a server load with reference to the performance index change table, Is provided. The performance index includes a CPU usage rate and a request index that provides an index of the number of requests being processed. The processing apparatus includes a tendency determination unit and a load calculation unit. The trend determination unit determines whether the number of requests being processed is increasing or decreasing with respect to each of the plurality of servers based on the time series change of the request index. The load calculation unit corrects the CPU usage rate based on the determination result, and calculates the corrected CPU usage rate as the server load. Specifically, when the determination result is increasing, the load calculation unit upwardly corrects the CPU usage rate. On the other hand, when the determination result is decreasing, the load calculation unit corrects the CPU usage rate downward.

本発明の第２の観点において、複数のサーバの負荷を管理するサーバ負荷管理方法が提供される。そのサーバ負荷管理方法は、（Ａ）複数のサーバの各々の性能指標の時系列変化を示す性能指標変化テーブルを、記憶装置から読み出すステップと、ここで、性能指標は、ＣＰＵ使用率と、処理中のリクエスト数の指標を与えるリクエストインデックスとを含み、（Ｂ）リクエストインデックスの時系列変化に基づいて、複数のサーバの各々に関して処理中のリクエスト数が増加傾向か減少傾向かの判定を行うステップと、（Ｃ）ＣＰＵ使用率を上記判定の結果に基づいて修正し、修正後ＣＰＵ使用率をサーバ負荷として算出するステップと、を含む。判定結果が増加傾向の場合、ＣＰＵ使用率は上方修正される。一方、判定結果が減少傾向の場合、ＣＰＵ使用率は下方修正される。 In a second aspect of the present invention, a server load management method for managing loads of a plurality of servers is provided. The server load management method includes (A) a step of reading a performance index change table indicating a time series change of each performance index of a plurality of servers from a storage device, wherein the performance index includes a CPU usage rate, a process And (B) determining whether the number of requests being processed is increasing or decreasing with respect to each of the plurality of servers, based on a time-series change of the request index. And (C) correcting the CPU usage rate based on the result of the determination and calculating the corrected CPU usage rate as a server load. If the determination result is increasing, the CPU usage rate is corrected upward. On the other hand, when the determination result is decreasing, the CPU usage rate is corrected downward.

本発明の第３の観点において、複数のサーバの負荷の管理処理をコンピュータに実行させるサーバ負荷管理プログラムが提供される。その管理処理は、（ａ）複数のサーバの各々の性能指標の時系列変化を示す性能指標変化テーブルを、記憶装置から読み出すステップと、ここで、性能指標は、ＣＰＵ使用率と、処理中のリクエスト数の指標を与えるリクエストインデックスとを含み、（ｂ）リクエストインデックスの時系列変化に基づいて、複数のサーバの各々に関して処理中のリクエスト数が増加傾向か減少傾向かの判定を行うステップと、（ｃ）ＣＰＵ使用率を上記判定の結果に基づいて修正し、修正後ＣＰＵ使用率をサーバ負荷として算出するステップと、を含む。判定結果が増加傾向の場合、ＣＰＵ使用率は上方修正される。一方、判定結果が減少傾向の場合、ＣＰＵ使用率は下方修正される。 In a third aspect of the present invention, a server load management program for causing a computer to execute a load management process for a plurality of servers is provided. The management process includes: (a) a step of reading a performance index change table indicating a time series change of each of the performance indexes of the plurality of servers from the storage device, wherein the performance index includes the CPU usage rate, And (b) determining whether the number of requests being processed is increasing or decreasing with respect to each of the plurality of servers, based on a time-series change of the request index, (C) correcting the CPU usage rate based on the determination result, and calculating the corrected CPU usage rate as a server load. If the determination result is increasing, the CPU usage rate is corrected upward. On the other hand, when the determination result is decreasing, the CPU usage rate is corrected downward.

本発明によれば、計算機システムにおいて後処理が発生する場合でも、サーバ負荷を適切に判定することが可能となる。その結果、計算機システムにおいて適切な負荷分散が可能となる。 According to the present invention, it is possible to appropriately determine the server load even when post-processing occurs in the computer system. As a result, appropriate load distribution can be achieved in the computer system.

添付図面を参照して、本発明の実施の形態に係るサーバ負荷管理システム、方法及びプログラムを説明する。 A server load management system, method, and program according to embodiments of the present invention will be described with reference to the accompanying drawings.

１．概要
本発明の実施の形態では、複数のサーバから構成される計算機システムに関して各サーバの負荷を管理するための技術が提供される。この計算機システムにおいて、各サーバは、ユーザあるいは前段のサーバからリクエスト（処理要求）を受け取り、要求された処理を実行する。リクエスト処理後、各サーバは、ユーザあるいは前段階のサーバにレスポンスを返す。このレスポンスの返信後に実行が許容される処理が「後処理」である。尚、後処理は、リクエストを処理したプロセスとは異なるプロセスによって実行される。1. Outline In the embodiment of the present invention, a technique for managing the load of each server with respect to a computer system composed of a plurality of servers is provided. In this computer system, each server receives a request (processing request) from a user or a previous-stage server, and executes the requested processing. After processing the request, each server returns a response to the user or the previous server. Processing that is allowed to be executed after the response is returned is “post-processing”. The post-processing is executed by a process different from the process that processed the request.

あるサーバがレスポンスを返した時点で、当該サーバにおける処理中リクエスト数は減少する。しかしながら、後処理が発生する場合、その後処理に伴う負荷に関しては、その後処理を担当するプロセス内で該当処理が終了するまで減少することにはならない。従って、当該サーバの負荷を適切に判定するためには、処理中リクエスト数だけでなく、後処理に伴う負荷をも考慮することが望ましい。本実施の形態の１つの目的は、後処理を考慮することによって、サーバ負荷をより正確に算出する（見積もる）ことである。 When a server returns a response, the number of requests being processed in the server decreases. However, when post-processing occurs, the load associated with the subsequent processing does not decrease until the corresponding processing ends in the process in charge of the subsequent processing. Therefore, in order to appropriately determine the load on the server, it is desirable to consider not only the number of requests being processed but also the load associated with post-processing. One object of the present embodiment is to more accurately calculate (estimate) the server load by considering post-processing.

サーバ負荷は、例えば「ＣＰＵ使用率」で表される。ＣＰＵ使用率とは、ＣＰＵが単位時間あたりに処理を行う割合である。例えば、ＣＰＵが単位時間のうち３０％の時間だけ処理を行い、残りの７０％の時間はアイドル状態であったとき、ＣＰＵ使用率は０．３（３０％）である。 The server load is represented by, for example, “CPU usage rate”. The CPU usage rate is a rate at which the CPU performs processing per unit time. For example, when the CPU performs processing for 30% of the unit time and the remaining 70% is idle, the CPU usage rate is 0.3 (30%).

図１Ａは、あるサーバに関するＣＰＵ使用率と処理中リクエスト数との関係を概念的に示している。縦軸がＣＰＵ使用率Ｕを表し、横軸が処理中リクエスト数Ｃを表している。図１Ａ中の破曲線は、後処理が発生しない場合のＣＰＵ使用率Ｕと処理中リクエスト数Ｃとの関係を示している。例えばＭ／Ｍ／１型待ち行列モデルに従うと、その関係はＵ＝１−１／（Ｃ＋１）で表される。一般には、処理中リクエスト数Ｃが増加するにつれて、ＣＰＵ使用率Ｕも増加する。逆に、処理中リクエスト数Ｃが減少するにつれて、ＣＰＵ使用率Ｕも減少する。 FIG. 1A conceptually shows the relationship between the CPU usage rate and the number of requests being processed for a certain server. The vertical axis represents the CPU usage rate U, and the horizontal axis represents the number C of requests being processed. The broken line in FIG. 1A shows the relationship between the CPU usage rate U and the number C of requests being processed when no post-processing occurs. For example, according to the M / M / 1 type queuing model, the relationship is represented by U = 1−1 / (C + 1). In general, as the number C of requests being processed increases, the CPU usage rate U also increases. Conversely, as the number C of requests being processed decreases, the CPU usage rate U also decreases.

図１Ａ中の矢印は、後処理が発生する場合のＣＰＵ使用率Ｕと処理中リクエスト数Ｃとの関係を概念的に示している。特に、右向きの矢印は、処理中リクエスト数Ｃが「増加傾向」にある場合を示し、左向きの矢印は、処理中リクエスト数Ｃが「減少傾向」にある場合を示している。図１Ａに示されるように、概して、減少傾向の場合のＣＰＵ使用率Ｕは、増加傾向の場合のＣＰＵ使用率Ｕよりも高い。これは、次のような理由によると考えられる。処理中リクエスト数Ｃが増加傾向にある場合、それまでに処理したリクエスト数は比較的少なく、後処理に伴う負荷も比較的小さい。従って、ＣＰＵ使用率Ｕが比較的低くなる。一方、処理中リクエスト数Ｃが減少傾向にある場合、それまでに処理したリクエスト数は比較的多く、後処理に伴う負荷が比較的大きい。従って、ＣＰＵ使用率Ｕが比較的高くなる。 The arrows in FIG. 1A conceptually show the relationship between the CPU usage rate U and the number C of requests being processed when post-processing occurs. In particular, a right-pointing arrow indicates a case where the number of requests being processed C is “increasing tendency”, and a left-pointing arrow indicates a case where the number of requests being processed C is “decreasing”. As shown in FIG. 1A, in general, the CPU usage rate U in the case of a decreasing trend is higher than the CPU usage rate U in the case of an increasing trend. This is considered to be due to the following reasons. When the number C of requests being processed tends to increase, the number of requests processed so far is relatively small, and the load associated with post-processing is also relatively small. Therefore, the CPU usage rate U is relatively low. On the other hand, when the number C of requests being processed tends to decrease, the number of requests processed so far is relatively large, and the load associated with post-processing is relatively large. Therefore, the CPU usage rate U is relatively high.

このように、増加傾向の場合と減少傾向の場合とでは、同じＣＰＵ使用率Ｕを与える処理中リクエスト数Ｃが異なっている。言い換えれば、処理中リクエスト数Ｃが同じであったとしても、増加傾向の場合と減少傾向の場合とではＣＰＵ使用率Ｕが異なる。すなわち、後処理が発生する場合は、処理中リクエスト数Ｃから単純にサーバ負荷（ＣＰＵ使用率）を算出することはできないのである。サーバ負荷を適切に算出するためには、処理中リクエスト数Ｃが増加傾向にあるか減少傾向にあるかを認識することが重要である。 As described above, the number C of requests being processed that gives the same CPU usage rate U is different between an increasing tendency and a decreasing tendency. In other words, even if the number C of requests being processed is the same, the CPU usage rate U differs between an increasing trend and a decreasing trend. That is, when post-processing occurs, the server load (CPU usage rate) cannot be simply calculated from the number C of requests being processed. In order to calculate the server load appropriately, it is important to recognize whether the number of requests C being processed is increasing or decreasing.

そこで、本実施の形態では、各サーバから、ＣＰＵ使用率Ｕと共に処理中リクエスト数Ｃに関する情報が継続的に収集される。そして、処理中リクエスト数Ｃの時系列変化に基づいて、処理中リクエスト数Ｃが増加傾向か減少傾向かの判定が行われる。更に、その判定結果に基づいて、ＣＰＵ使用率Ｕの修正（補正）が行われる。このときの修正方針として、本願発明者は次のものを考え出した。 Therefore, in this embodiment, information on the number C of requests being processed is continuously collected from each server together with the CPU usage rate U. Based on the time-series change in the number of requests C being processed, it is determined whether the number C of requests being processed is increasing or decreasing. Further, the CPU usage rate U is corrected (corrected) based on the determination result. As a correction policy at this time, the present inventor has devised the following.

まず、処理中リクエスト数Ｃが増加傾向にある場合を考える。この場合、上述の通り、それまでに処理したリクエスト数は比較的少なく、後処理に伴う負荷も現在のところは比較的小さい。しかしながら、処理中リクエスト数Ｃが増加傾向にあるということは、後処理に伴う負荷が将来的に増大することを意味する。言い換えれば、後処理の増大によって、将来的にはＣＰＵ使用率Ｕがより高くなる確率が極めて高い。従って、後処理に伴う負荷の増大を見込んで、ＣＰＵ使用率Ｕはより高い方に修正される。すなわち、ＣＰＵ使用率Ｕは「上方修正」される。 First, let us consider a case where the number C of requests being processed tends to increase. In this case, as described above, the number of requests processed so far is relatively small, and the load accompanying post-processing is also relatively small at present. However, the fact that the number C of requests being processed tends to increase means that the load accompanying post-processing will increase in the future. In other words, the probability that the CPU usage rate U will become higher in the future due to the increase in post-processing is extremely high. Accordingly, the CPU usage rate U is corrected to a higher value in anticipation of an increase in load accompanying post-processing. That is, the CPU usage rate U is “upwardly revised”.

次に、処理中リクエスト数Ｃが減少傾向にある場合を考える。この場合、上述の通り、それまでに処理したリクエスト数は比較的多く、後処理に伴う負荷は現在のところは比較的大きい。しかしながら、処理中リクエスト数Ｃが減少傾向にあるということは、後処理に伴う負荷が将来的に減少することを意味する。言い換えれば、後処理の減少によって、将来的にはＣＰＵ使用率Ｕがより低くなる確率が極めて高い。従って、後処理に伴う負荷の減少を見込んで、ＣＰＵ使用率Ｕはより低い方に修正される。すなわち、ＣＰＵ使用率Ｕは「下方修正」される。 Next, consider a case where the number C of requests being processed tends to decrease. In this case, as described above, the number of requests processed so far is relatively large, and the load associated with post-processing is currently relatively large. However, the fact that the number C of requests being processed tends to decrease means that the load accompanying post-processing will decrease in the future. In other words, there is a very high probability that the CPU usage rate U will become lower in the future due to the decrease in post-processing. Therefore, the CPU usage rate U is corrected to a lower value in anticipation of a reduction in load accompanying post-processing. That is, the CPU usage rate U is “corrected downward”.

このように、本実施の形態によれば、後処理に伴う負荷を考慮することによって、各サーバから通知されるＣＰＵ使用率Ｕの修正（補正）が実施される。修正により得られるＣＰＵ使用率は、以下「修正後ＣＰＵ使用率Ｕ’」と参照される。この修正後ＣＰＵ使用率Ｕ’が、修正前のＣＰＵ使用率Ｕよりも正確なサーバ負荷として扱われる。 As described above, according to this embodiment, the CPU usage rate U notified from each server is corrected (corrected) by taking into account the load associated with post-processing. The CPU usage rate obtained by the correction is hereinafter referred to as “corrected CPU usage rate U ′”. This corrected CPU usage rate U 'is treated as a more accurate server load than the CPU usage rate U before correction.

尚、処理中リクエスト数Ｃが増加傾向か減少傾向かを判断するにあたり、処理中リクエスト数Ｃそのもの以外のパラメータを用いることも可能である。 In determining whether the number of requests C being processed is increasing or decreasing, it is possible to use parameters other than the number of requests C being processed itself.

例えば、図１Ｂは、あるサーバに関するＣＰＵ使用率と単位時間あたりのリクエスト到着数との関係を概念的に示している。縦軸がＣＰＵ使用率Ｕを表し、横軸が単位時間あたりのリクエスト到着数λを表している。図１Ｂ中の破曲線は、後処理が発生しない場合の関係を示しており、例えばＵ＝λ／μで表される。μは、ＣＰＵの単位時間あたりの最大処理数である。また、図１Ｂ中の矢印は、後処理が発生する場合の関係を概念的に示している。右向きの矢印は、リクエスト到着数λが増加傾向にある場合を示し、左向きの矢印は、リクエスト到着数λが減少傾向にある場合を示している。この図１Ｂで示される特性も、図１Ａで示された特性と同様であることに留意されたい。従って、リクエスト到着数λが増加傾向か減少傾向かを判定することは、処理中リクエスト数Ｃが増加傾向か減少傾向かを判定することと同じ意味を持つ。すなわち、リクエスト到着数λの時系列変化に基づいて、処理中リクエスト数Ｃが増加傾向か減少傾向かの判定が可能である。 For example, FIG. 1B conceptually shows the relationship between the CPU usage rate for a certain server and the number of request arrivals per unit time. The vertical axis represents the CPU usage rate U, and the horizontal axis represents the number of request arrivals λ per unit time. The broken curve in FIG. 1B shows the relationship when no post-processing occurs, and is represented by, for example, U = λ / μ. μ is the maximum number of processes per unit time of the CPU. Moreover, the arrows in FIG. 1B conceptually show the relationship when post-processing occurs. A right-pointing arrow indicates a case where the request arrival number λ tends to increase, and a left-pointing arrow indicates a case where the request arrival number λ tends to decrease. It should be noted that the characteristic shown in FIG. 1B is similar to the characteristic shown in FIG. 1A. Therefore, determining whether the request arrival number λ is increasing or decreasing has the same meaning as determining whether the number of requests C being processed is increasing or decreasing. That is, it is possible to determine whether the number C of requests being processed is increasing or decreasing based on the time series change of the request arrival number λ.

その他、到着リクエストが格納されるキュー（待ち行列）の長さや、リクエストを処理するためのプロセス数を同様に用いることも可能である。更に、リクエスト到着数とリクエスト退去数（レスポンスが返されたリクエストの数）の組み合わせが用いられてもよい。ある時点までのリクエスト到着数の総和から、その時点までのリクエスト退去数の総和を引くことによって、処理中リクエスト数Ｃを見積もることができる。 In addition, the length of the queue (queue) in which the arrival request is stored and the number of processes for processing the request can be similarly used. Further, a combination of the number of request arrivals and the number of request evictions (the number of requests for which responses have been returned) may be used. The number C of requests being processed can be estimated by subtracting the total number of request evictions up to that point from the total number of request arrivals up to a certain point.

以上に説明されたパラメータは、以下「リクエストインデックスＲＩ」と参照される。リクエストインデックスＲＩは、処理中のリクエスト数Ｃの指標を与えるパラメータである。リクエストインデックスＲＩとしては、上述の処理中リクエスト数Ｃそのもの、サーバへの単位時間あたりのリクエスト到着数λ、到着リクエストが格納されるキュー（待ち行列）の長さ、リクエストを処理するためのプロセス数、リクエスト到着数λとリクエスト退去数の組み合わせなどが挙げられる。このようなリクエストインデックスＲＩの時系列変化に基づいて、処理中リクエストＣが増加傾向か減少傾向かの判定を行うことが可能である。 The parameter described above is hereinafter referred to as “request index RI”. The request index RI is a parameter that gives an index of the number C of requests being processed. The request index RI includes the number of requests C being processed, the number of request arrivals λ per unit time to the server, the length of the queue (queue) in which the arrival requests are stored, and the number of processes for processing the requests. And a combination of the request arrival number λ and the request withdrawal number. Based on such a time-series change of the request index RI, it is possible to determine whether the request C being processed is increasing or decreasing.

以上に説明された処理を実現するためのサーバ負荷管理システム及びその動作を、以下詳細に説明する。 A server load management system and its operation for realizing the processing described above will be described in detail below.

２．構成
図２は、本実施の形態に係るサーバ負荷管理システム１の構成例を示すブロック図である。サーバ負荷管理システム１は、ネットワークを介して複数のサーバ１００−１〜１００−ｎと通信可能に接続されたコンピュータであり、複数のサーバ１００−１〜１００−ｎのそれぞれの負荷を管理する。サーバ１００の台数はｎ台である（ｎは１以上の整数）。2. Configuration FIG. 2 is a block diagram showing a configuration example of the serverload management system 1 according to the present embodiment. The serverload management system 1 is a computer that is communicably connected to a plurality of servers 100-1 to 100-n via a network, and manages each load of the plurality of servers 100-1 to 100-n. The number ofservers 100 is n (n is an integer of 1 or more).

例えば、各サーバ１００は、Ｗｅｂサーバ、アプリケーションサーバ、あるいはデータベースサーバである。Ｗｅｂサーバの場合、各サーバ１００は、ユーザからのリクエストを、ユーザから直接あるいは負荷分散装置などを経由して受け取り、必要な処理を実行し、処理結果（レスポンス）をユーザに返す。アプリケーションサーバの場合、各サーバ１００は、前段のＷｅｂサーバからリクエストを受け取り、必要な処理を実行し、処理結果（レスポンス）をＷｅｂサーバに返す。データベースサーバの場合、各サーバ１００は、前段のアプリケーションサーバからリクエスト（データベース処理要求）を受け取り、必要な処理を実行し、処理結果（レスポンス）をアプリケーションサーバに返す。いずれの場合でも、各サーバ１００は、何らかのリクエストを受け取り、そのリクエストに対するレスポンスを返す。そして、レスポンスの返信後に「後処理」が発生する。 For example, eachserver 100 is a Web server, an application server, or a database server. In the case of a Web server, eachserver 100 receives a request from a user directly or via a load balancer, executes necessary processing, and returns a processing result (response) to the user. In the case of an application server, eachserver 100 receives a request from the preceding Web server, executes necessary processing, and returns a processing result (response) to the Web server. In the case of a database server, eachserver 100 receives a request (database processing request) from the preceding application server, executes necessary processing, and returns a processing result (response) to the application server. In any case, eachserver 100 receives some request and returns a response to the request. Then, “post-processing” occurs after the response is returned.

各サーバ１００は、自身の「性能指標」を計測するための監視部１１０を有している。性能指標は、各サーバ１００におけるＣＰＵ使用率Ｕと上述のリクエストインデックスＲＩを含んでいる。例えば、性能指標は、ＣＰＵ使用率Ｕと処理中リクエスト数Ｃである。このような性能指標は、ＵＮＩＸ（登録商標）（もしくはＬｉｎｕｘ）オペレーティングシステムで用意されている“ｓｙｓｓｔａｔ”、“ｖｍｓｔａｔ”、“ｔｏｐ”などのコマンドを用いることにより計測することができる。あるいは、Ｗｉｎｄｏｗｓ（登録商標）オペレーティングシステムの場合、“パフォーマンスモニタ”を用いることにより性能指標を計測することができる。各サーバ１００は、監視部１１０を用いて性能指標を計測し、得られた性能指標と計測時刻を示す性能指標データＰＩをサーバ負荷管理システム１に逐次送信する。 Eachserver 100 has amonitoring unit 110 for measuring its own “performance index”. The performance index includes the CPU usage rate U in eachserver 100 and the above-described request index RI. For example, the performance index is the CPU usage rate U and the number of requests C being processed. Such a performance index can be measured by using commands such as “sysstat”, “vmstat”, and “top” prepared in the UNIX (registered trademark) (or Linux) operating system. Alternatively, in the case of a Windows (registered trademark) operating system, a performance index can be measured by using a “performance monitor”. Eachserver 100 measures a performance index using themonitoring unit 110 and sequentially transmits performance index data PI indicating the obtained performance index and measurement time to the serverload management system 1.

図２に示されるように、本実施の形態に係るサーバ負荷管理システム１は、処理装置２、記憶装置３、通信装置４、入力装置５及び出力装置６を備えている。処理装置２はＣＰＵを含んでおり、各種データ処理を行う。記憶装置３として、ＨＤＤ（Hard Disk Drive）やＲＡＭ（Random Access Memory）等が例示される。通信装置４は、ネットワークに接続されるネットワークインタフェースである。入力装置５として、キーボード、マウス、メディアドライブ等が例示される。出力装置６として、ディスプレイ等が例示される。 As shown in FIG. 2, the serverload management system 1 according to the present embodiment includes aprocessing device 2, astorage device 3, acommunication device 4, aninput device 5, and an output device 6. Theprocessing device 2 includes a CPU and performs various data processing. Examples of thestorage device 3 include a hard disk drive (HDD) and a random access memory (RAM). Thecommunication device 4 is a network interface connected to the network. Examples of theinput device 5 include a keyboard, a mouse, and a media drive. Examples of the output device 6 include a display.

記憶装置３には、性能指標変化テーブルＴＢＬ、サーバグループ管理リストＧＲＰなどが格納される。性能指標変化テーブルＴＢＬは、各サーバ１００の性能指標の時系列変化を示すテーブルであり、その詳細は後述される。サーバグループ管理リストＧＲＰは、各サーバをグループ分けして示すリストであり、その詳細は後述される。 Thestorage device 3 stores a performance index change table TBL, a server group management list GRP, and the like. The performance index change table TBL is a table showing the time series change of the performance index of eachserver 100, and details thereof will be described later. The server group management list GRP is a list showing each server divided into groups, and details thereof will be described later.

処理装置２は、記憶装置３に格納された性能指標変化テーブルＴＢＬを参照して、サーバ負荷を算出する。より詳細には、処理装置２は、サーバ負荷判定プログラムＰＲＯＧ１を実行することにより、サーバ負荷算出処理を行う。サーバ負荷判定プログラムＰＲＯＧ１は、処理装置２によって実行されるソフトウェアプログラムである。典型的には、サーバ負荷判定プログラムＰＲＯＧ１は、コンピュータ読み取り可能な記録媒体に記録されている。処理装置２は、その記録媒体からサーバ負荷判定プログラムＰＲＯＧ１を読み出し、実行する。 Theprocessing device 2 refers to the performance index change table TBL stored in thestorage device 3 and calculates the server load. More specifically, theprocessing device 2 performs a server load calculation process by executing a server load determination program PROG1. The server load determination program PROG1 is a software program executed by theprocessing device 2. Typically, the server load determination program PROG1 is recorded on a computer-readable recording medium. Theprocessing device 2 reads the server load determination program PROG1 from the recording medium and executes it.

図３は、本実施の形態に係る処理装置２の機能を示すブロック図である。図３に示されるように、処理装置２は、性能指標蓄積部１０、傾向判定部２０、サーバ負荷算出部３０、及びサーバ選択部４０を有している。これら性能指標蓄積部１０、傾向判定部２０、サーバ負荷算出部３０、及びサーバ選択部４０は、処理装置２とサーバ負荷判定プログラムＰＲＯＧ１との協働によって実現され、それぞれ所定の機能を提供する。以下、本実施の形態に係るサーバ負荷管理システム１（処理装置２）による処理を詳細に説明する。 FIG. 3 is a block diagram illustrating functions of theprocessing apparatus 2 according to the present embodiment. As illustrated in FIG. 3, theprocessing device 2 includes a performanceindex accumulation unit 10, atendency determination unit 20, a serverload calculation unit 30, and aserver selection unit 40. The performanceindex accumulation unit 10, thetrend determination unit 20, the serverload calculation unit 30, and theserver selection unit 40 are realized by the cooperation of theprocessing device 2 and the server load determination program PROG1, and each provide a predetermined function. Hereinafter, processing by the server load management system 1 (processing device 2) according to the present embodiment will be described in detail.

３．処理フロー
図４は、本実施の形態に係るサーバ負荷管理システム１による処理を示すフローチャートである。図２〜図４を適宜参照して、本実施の形態に係る処理フローを詳しく説明する。3. Processing Flow FIG. 4 is a flowchart showing processing by the serverload management system 1 according to the present embodiment. The processing flow according to this embodiment will be described in detail with reference to FIGS.

ステップＳ１０：
各サーバ１００の監視部１１０は、自身の性能指標を所定の時間間隔で測定する。性能指標は、ＣＰＵ使用率Ｕと上述のリクエストインデックスＲＩを含んでいる。そして、各サーバ１００は、得られた性能指標とその測定時刻を示す性能指標データＰＩを、サーバ負荷管理システム１に逐次送信する。サーバ負荷管理システム１の通信装置４は、各サーバ１００から送られてくる性能指標データＰＩを受け取る。性能指標蓄積部１０は、受け取った性能指標データＰＩを、記憶装置３中の性能指標変化テーブルＴＢＬに登録する。Step S10:
Themonitoring unit 110 of eachserver 100 measures its own performance index at predetermined time intervals. The performance index includes the CPU usage rate U and the above-described request index RI. Eachserver 100 sequentially transmits the obtained performance index and performance index data PI indicating the measurement time to the serverload management system 1. Thecommunication device 4 of the serverload management system 1 receives performance index data PI sent from eachserver 100. The performanceindex accumulation unit 10 registers the received performance index data PI in the performance index change table TBL in thestorage device 3.

図５Ａは、本実施の形態に係る性能指標変化テーブルＴＢＬを概念的に示している。図５Ａに示されるように、性能指標変化テーブルＴＢＬは、測定時刻（あるいはサーバ負荷管理システム１の受信時刻）と性能指標（ＣＰＵ使用率Ｕ、リクエストインデックスＲＩ）を示している。つまり、性能指標変化テーブルＴＢＬは、性能指標の時系列変化を示している。このような性能指標変化テーブルＴＢＬは、サーバ１００毎に作成される。 FIG. 5A conceptually shows the performance index change table TBL according to the present embodiment. As shown in FIG. 5A, the performance index change table TBL shows the measurement time (or the reception time of the server load management system 1) and the performance index (CPU usage rate U, request index RI). That is, the performance index change table TBL indicates the time series change of the performance index. Such a performance index change table TBL is created for eachserver 100.

図５Ｂは、リクエストインデックスＲＩが処理中リクエスト数Ｃである場合の、あるサーバ１００に関する性能指標変化テーブルＴＢＬの例を示している。図５Ｂに示されるように、性能指標変化テーブルＴＢＬは、当該サーバ１００のＣＰＵ使用率Ｕ及び処理中リクエスト数Ｃの時系列変化を示している。例えば、時刻Ｔ_１において、ＣＰＵ使用率はＵ_１であり処理中リクエスト数はＣ_１であり、直近の時刻Ｔｓのおいて、ＣＰＵ使用率はＵｓであり処理中リクエスト数はＣｓである。新たに受け取る性能指標データＰＩは、テーブルの次の行に追記される。FIG. 5B shows an example of the performance index change table TBL related to acertain server 100 when the request index RI is the number C of requests being processed. As shown in FIG. 5B, the performance index change table TBL shows time-series changes in the CPU usage rate U and the number C of requests being processed of theserver 100. For example, at time_{T 1,} CPU utilization_{U 1} a is treated in the number of requests is_{C 1,} Oite of the most recent time Ts, CPU utilization is Us Processing number of requests is Cs. The newly received performance index data PI is added to the next line of the table.

ステップＳ１５：
サーバ負荷の算出処理は、所定のタイミング毎に実施される。そのタイミングまでは、ステップＳ１０が繰り返され、性能指標変化テーブルＴＢＬにデータが蓄積される。所定のタイミングになると、次のステップＳ２０が実行される。Step S15:
The server load calculation process is performed every predetermined timing. Until that timing, step S10 is repeated and data is accumulated in the performance index change table TBL. When the predetermined timing is reached, the next step S20 is executed.

ステップＳ２０：
傾向判定部２０は、各サーバ１００に関して、処理中リクエスト数Ｃが増加傾向か減少傾向かの判定を行う。この判定にあたり、傾向判定部２０は、記憶装置３に格納された性能指標変化テーブルＴＢＬで示されるリクエストインデックスＲＩの時系列変化を参照することができる。つまり、傾向判定部２０は、リクエストインデックスＲＩの時系列変化に基づいて、各サーバ１００の処理中リクエスト数Ｃの変化傾向を判定する。そして、傾向判定部２０は、その判定結果に基づいて、サーバ１００−１〜１００−ｎのそれぞれをグループ分けする。上記判定結果、すなわち、処理中リクエスト数Ｃの変化傾向に応じて、次の３種類のサーバグループが考えられる。Step S20:
Thetrend determination unit 20 determines whether the number of requests being processed C is increasing or decreasing for eachserver 100. In this determination, thetrend determination unit 20 can refer to the time series change of the request index RI indicated by the performance index change table TBL stored in thestorage device 3. That is, thetendency determination unit 20 determines the change tendency of the number of requests C being processed by eachserver 100 based on the time series change of the request index RI. Then, thetendency determination unit 20 groups each of the servers 100-1 to 100-n based on the determination result. The following three types of server groups are conceivable depending on the determination result, that is, the changing tendency of the number of requests C being processed.

（１）サーバグループＺ：処理中リクエスト数Ｃが増加傾向にある
（２）サーバグループＧ：処理中リクエスト数Ｃが減少傾向にある
（３）サーバグループＥ：それ以外(1) Server group Z: The number C of requests being processed is increasing (2) Server group G: The number C of requests being processed is decreasing (3) Server group E: Other

記憶装置３中のサーバグループ管理リストＧＲＰは、このようなサーバのグループ分けを示す。つまり、サーバグループ管理リストＧＲＰは、サーバグループ（Ｚ，Ｇ，Ｅ）のそれぞれに属しているサーバのリストを示す。尚、サーバグループＥは、必ずしも設定されなくてもよく、必要に応じて設定されればよい。 The server group management list GRP in thestorage device 3 shows such server grouping. That is, the server group management list GRP shows a list of servers belonging to each of the server groups (Z, G, E). Note that the server group E does not necessarily have to be set, and may be set as necessary.

図６は、本ステップＳ２０における処理フローの一例を示している。まず、初期化が行われる（ステップＳ２１）。具体的には、サーバグループ管理リストＧＲＰが初期化され、全てのサーバグループが空に設定される。また、判定対象のサーバ番号ｉが１に初期化される。サーバ番号ｉが取り得る値は、１からｎ（サーバ総数）までである。その後、判定対象のサーバ１００−ｉに関して、次の処理が実施される。 FIG. 6 shows an example of the processing flow in this step S20. First, initialization is performed (step S21). Specifically, the server group management list GRP is initialized and all server groups are set to be empty. In addition, the server number i to be determined is initialized to 1. Possible values of the server number i are 1 to n (total number of servers). Thereafter, the following processing is performed on the determination target server 100-i.

ステップＳ２２：
まず、傾向判定部２０は、記憶装置３に格納された性能指標変化テーブルＴＢＬから、サーバ１００−ｉに関する性能指標を読み出す。ここでは、性能指標は、ＣＰＵ使用率Ｕと処理中リクエスト数Ｃであるとする。サーバ１００−ｉに関して、次の（ｋ＋１）種類の時刻のそれぞれにおける性能指標が読み出される。ｋは１以上の整数である。Step S22:
First, thetrend determination unit 20 reads a performance index related to the server 100-i from the performance index change table TBL stored in thestorage device 3. Here, it is assumed that the performance indexes are the CPU usage rate U and the number of requests C being processed. Regarding the server 100-i, the performance index at each of the next (k + 1) types of times is read out. k is an integer of 1 or more.

時刻Ｔ_ｐ：ＣＰＵ使用率Ｕ_ｐ，処理中リクエスト数Ｃ_ｐ
時刻Ｔ_ｐ−１：ＣＰＵ使用率Ｕ_ｐ−１，処理中リクエスト数Ｃ_ｐ−１
・・・
時刻Ｔ_{ｐ−（ｋ−１）}：ＣＰＵ使用率Ｕ_{ｐ−（ｋ−１）}，処理中リクエスト数Ｃ_{ｐ−（ｋ−１）}
時刻Ｔ_ｐ−ｋ：ＣＰＵ使用率Ｕ_ｐ−ｋ，処理中リクエスト数Ｃ_ｐ−ｋTime T_p : CPU usage rate U_p , number of requests being processed C_p
Time T_p-1 : CPU usage rate U_p-1 , number of requests being processed C_p-1
...
Time T_{p− (k−1)} : CPU usage rate U_{p− (k−1)} , number of requests being processed C_{p− (k−1)}
Time T_pk : CPU usage rate U_pk , number of requests being processed C_pk

時刻Ｔ_ｐは現在時刻に最も近い時刻であり、パラメータｋが大きくなるにつれて時刻は過去に遡る。パラメータｋは、ユーザによってあらかじめ指定されている。典型的にはパラメータｋは１に設定され（ｋ＝１）、その場合、直近の時刻Ｔ_ｐとその前の時刻Ｔ_ｐ−１のそれぞれにおける性能指標が読み出される。The time T_p is the time closest to the current time, and the time goes back in the past as the parameter k increases. The parameter k is designated in advance by the user. Typically, the parameter k is set to 1 (k = 1), and in this case, the performance index at each of the latest time T_p and the previous time T_p−1 is read.

ステップＳ２３：
次に、傾向判定部２０は、読み出された処理中リクエスト数Ｃ_ｐ〜Ｃ_ｐ−ｋ（リクエストインデックスＲＩ）の時系列変化に基づいて、サーバ１００−ｉの処理中リクエスト数Ｃの変化率Ｖを算出する。そのために、傾向判定部２０は、時刻Ｔ_ｐ−ｊ〜Ｔ_{ｐ−（ｊ−１）}間での処理中リクエスト数Ｃの単位変化率Ｖ_ｊをそれぞれ算出する（ｊは、１〜ｋの整数）。単位変化率Ｖ_ｊは、次の式（１）で表される。Step S23:
Next, tendsdetermination unit 20, when based on the sequence change, the processing in the rate of change of the number of requests C servers 100-i of the number of processing read requests_{_C p}_~C_p-k (request index RI) V is calculated. Therefore, thetendency determination unit 20 calculates the unit change rate V_j of the number C of requests being processed between times T_{p−j to} T_{p− (j−1)} (j is an integer of 1 to k). ). The unit change rate V_j is expressed by the following equation (1).

更に、傾向判定部２０は、算出されたそれぞれの単位変化率Ｖ_ｊの平均値を変化率Ｖとして算出する。処理中リクエスト数Ｃの変化率Ｖは、次の式（２）で表される。Furthermore, thetendency determination unit 20 calculates the average value of the calculated unit change rates V_j as the change rate V. The change rate V of the number C of requests being processed is expressed by the following equation (2).

変化率Ｖは、単位変化率Ｖ_ｊの重み付け平均であってもよい。この場合、予め指定された重み付けパラメータβ_ｊを用いることにより、変化率Ｖは次の式（３）で表される。Change rate V may be weighted average of the unit change ratio V_j. In this case, the change rate V is expressed by the following equation (3) by using the weighting parameter β_j specified in advance.

パラメータｋが１の場合、あるいは、時間間隔Ｔ_{ｐ−（ｊ−１）}−Ｔ_ｐ−ｊが一定の場合には、単位変化率Ｖ_ｊは次の式（４）で定義されてもよい。When the parameter k is 1, or when the time interval T_{p− (j−1)} −T_p−j is constant, the unit change rate V_j may be defined by the following equation (4).

ステップＳ２４：
次に、傾向判定部２０は、ステップＳ２３で算出された処理中リクエスト数Ｃの変化率Ｖに基づいて、サーバ１００−ｉのグループ分けを行う。例えば、傾向判定部２０は、変化率Ｖを所定の閾値と比較する。変化率Ｖが所定の正の閾値ε以上である場合、傾向判定部２０は、処理中リクエスト数Ｃが増加傾向にあると判定する。従って、傾向判定部２０は、サーバ１００−ｉを、サーバグループ管理リストＧＲＰ中のサーバグループＺに登録する。一方、変化率Ｖが所定の負の閾値−ε以下である場合、傾向判定部２０は、処理中リクエスト数Ｃが減少傾向にあると判定する。従って、傾向判定部２０は、サーバ１００−ｉを、サーバグループ管理リストＧＲＰ中のサーバグループＧに登録する。それ以外の場合、傾向判定部２０は、サーバ１００−ｉを、サーバグループ管理リストＧＲＰ中のサーバグループＥに登録する。Step S24:
Next, thetendency determination unit 20 performs grouping of the servers 100-i based on the change rate V of the number of requests C being processed calculated in step S23. For example, thetendency determination unit 20 compares the change rate V with a predetermined threshold value. When the change rate V is equal to or greater than the predetermined positive threshold ε, thetendency determination unit 20 determines that the number C of requests being processed is increasing. Therefore, thetendency determination unit 20 registers the server 100-i in the server group Z in the server group management list GRP. On the other hand, when the change rate V is equal to or less than the predetermined negative threshold −ε, thetendency determination unit 20 determines that the number C of requests being processed is decreasing. Therefore, thetendency determination unit 20 registers the server 100-i in the server group G in the server group management list GRP. In other cases, thetendency determination unit 20 registers the server 100-i in the server group E in the server group management list GRP.

尚、上記閾値（ε，−ε）は、ユーザによってあらかじめ指定される微少量である。閾値（ε，−ε）は０であってもよく、その場合、サーバグループＥは設定されない。 The threshold value (ε, −ε) is a small amount specified in advance by the user. The threshold value (ε, −ε) may be 0. In this case, the server group E is not set.

ステップＳ２５，Ｓ２６：
現在のサーバ番号ｉがサーバ総数ｎと等しくない場合（ステップＳ２５；Ｎｏ）、サーバ番号ｉに１が加算され、次のサーバ１００−ｉが選択される（ステップＳ２６）。その後、ステップＳ２２〜Ｓ２４が同様に行われる。現在のサーバ番号ｉがサーバ総数ｎと等しい場合（ステップＳ２５；Ｙｅｓ）、ステップＳ２０は終了する。このようにして、全てのサーバ１００−１〜１００−ｎのグルーピングが完了し、サーバグループ管理リストＧＲＰが完成する。Steps S25 and S26:
When the current server number i is not equal to the total number n of servers (step S25; No), 1 is added to the server number i, and the next server 100-i is selected (step S26). Thereafter, steps S22 to S24 are similarly performed. If the current server number i is equal to the total number n of servers (step S25; Yes), step S20 ends. In this way, the grouping of all the servers 100-1 to 100-n is completed, and the server group management list GRP is completed.

図７Ａは、ステップＳ２０で作成されるサーバグループ管理リストＧＲＰの一例を示している。ここでは、サーバ総数ｎは５である。サーバグループＺ（増加傾向）は、サーバ１００−２とサーバ１００−３を含んでいる。サーバグループＧ（減少傾向）は、サーバ１００−１とサーバ１００−５を含んでいる。サーバグループＥ（それ以外）は、サーバ１００−４を含んでいる。 FIG. 7A shows an example of the server group management list GRP created in step S20. Here, the total number n of servers is 5. The server group Z (increasing tendency) includes a server 100-2 and a server 100-3. The server group G (decreasing trend) includes a server 100-1 and a server 100-5. The server group E (other than that) includes the server 100-4.

また、図７Ｂに示されるように、サーバグループ管理リストＧＲＰは、サーバ１００−１〜１００−５のそれぞれのＣＰＵ使用率Ｕ（１）〜Ｕ（５）を示していてもよい。ここで示されるＣＰＵ使用率Ｕ（１）〜Ｕ（５）の各々は、直近の時刻Ｔ_ｐでのＣＰＵ使用率Ｕ_ｐ、あるいは、所定期間内でのＣＰＵ使用率の平均値である。所定期間内でのＣＰＵ使用率の平均値は、例えば、本ステップＳ２０で性能指標変化テーブルＴＢＬから読み出されたＣＰＵ使用率Ｕ_ｐ〜Ｕ_ｐ−ｋの平均値である。Further, as illustrated in FIG. 7B, the server group management list GRP may indicate the CPU usage rates U (1) to U (5) of the servers 100-1 to 100-5. Each CPU utilization U (1) ~U (5) shown here, CPU utilization in the most recent time_{T p}_{U p} or an average value of the CPU utilization within a predetermined time period. The average value of the CPU usage rates within the predetermined period is, for example, the average value of the CPU usage rates U_{p to} U_p-k read from the performance index change table TBL in step S20.

ステップＳ３０：
サーバ負荷算出部３０は、サーバ負荷を算出（推定）する。より詳細には、サーバ負荷算出部３０は、上記ステップＳ２０での判定結果に応じてＣＰＵ使用率Ｕを修正（補正）し、修正後ＣＰＵ使用率Ｕ’を算出する。その修正後ＣＰＵ使用率Ｕ’が、サーバ負荷の推定値となる。Step S30:
The serverload calculation unit 30 calculates (estimates) the server load. More specifically, the serverload calculation unit 30 corrects (corrects) the CPU usage rate U according to the determination result in step S20, and calculates a corrected CPU usage rate U ′. The corrected CPU usage rate U ′ is an estimated value of the server load.

上記ステップＳ２０での判定結果、すなわち、各サーバ１００のグループ分けは、記憶装置３に格納されたサーバグループ管理リストＧＲＰを参照することにより得られる。修正対象のＣＰＵ使用率Ｕは、直近の時刻Ｔ_ｐでのＣＰＵ使用率Ｕ_ｐ、あるいは、所定期間内でのＣＰＵ使用率の平均値である。そのような修正対象のＣＰＵ使用率Ｕは、記憶装置２に格納されている性能指標変化テーブルＴＢＬを参照することにより得ることができる。あるいは、図７Ｂで示されたようなサーバグループ管理リストＧＲＰが作成されている場合、性能指標変化テーブルＴＢＬを参照することなく、修正対象のＣＰＵ使用率Ｕを得ることができる。The determination result in step S20, that is, the grouping of eachserver 100 is obtained by referring to the server group management list GRP stored in thestorage device 3. The CPU usage rate U to be corrected is the CPU usage rate U_p at the latest time T_p , or the average value of the CPU usage rates within a predetermined period. The CPU usage rate U to be corrected can be obtained by referring to the performance index change table TBL stored in thestorage device 2. Alternatively, when the server group management list GRP as shown in FIG. 7B has been created, the CPU usage rate U to be corrected can be obtained without referring to the performance index change table TBL.

まず、サーバグループＺに関するＣＰＵ使用率Ｕの修正を考える。サーバグループＺに含まれるサーバの処理中リクエスト数Ｃは増加傾向にある。処理中リクエスト数Ｃが増加傾向にあるということは、後処理に伴う負荷が将来的に増大することを意味する。言い換えれば、後処理の増大によって、将来的にはＣＰＵ使用率Ｕがより高くなる確率が極めて高い。従って、後処理に伴う負荷の増大を見込んで、サーバ負荷算出部３０は、ＣＰＵ使用率Ｕを「上方修正」する。 First, consider the correction of the CPU usage rate U for the server group Z. The number C of requests being processed by servers included in the server group Z tends to increase. The fact that the number C of requests being processed tends to increase means that the load accompanying post-processing will increase in the future. In other words, the probability that the CPU usage rate U will become higher in the future due to the increase in post-processing is extremely high. Therefore, the serverload calculation unit 30 “corrects upward” the CPU usage rate U in anticipation of an increase in load accompanying post-processing.

例えば、サーバ負荷算出部３０は、所定の比率だけＣＰＵ使用率Ｕを上方修正することができる。その所定の比率は、後処理が含まれない場合の負荷と後処理も含めた場合の負荷との間の比率を反映していることが望ましい。サーバがリクエストを受け取ってからレスポンスを返すまでのリクエスト処理に使われるＣＰＵ時間は“ｓ１”であるとする。また、サーバが当該レスポンスを返した後の後処理に使われるＣＰＵ時間は“ｓ２”であるとする。このとき、係数α＝ｓ１／（ｓ１＋ｓ２）は、後処理が含まれない場合の負荷と後処理も含めた場合の負荷との間の比率を反映していると言える。よって、サーバ負荷算出部３０は、ＣＰＵ使用率Ｕを係数α（＜１）で割ることによって、ＣＰＵ使用率Ｕを上方修正することができる。係数αは、ユーザによって予め指定されるとよい。サーバグループＺに含まれるあるサーバｉのＣＰＵ使用率はＵＺ（ｉ）であるとする。また、上方修正によって得られる修正後ＣＰＵ使用率はＵＺ’（ｉ）であるとする。このとき、修正後ＣＰＵ使用率ＵＺ’（ｉ）は、次の式（５）で与えられる。 For example, the serverload calculation unit 30 can upwardly correct the CPU usage rate U by a predetermined ratio. The predetermined ratio desirably reflects the ratio between the load when post-processing is not included and the load when post-processing is also included. It is assumed that the CPU time used for request processing from when the server receives a request until it returns a response is “s1”. Further, it is assumed that the CPU time used for post-processing after the server returns the response is “s2”. At this time, it can be said that the coefficient α = s1 / (s1 + s2) reflects the ratio between the load when the post-processing is not included and the load when the post-processing is also included. Therefore, the serverload calculation unit 30 can upwardly correct the CPU usage rate U by dividing the CPU usage rate U by the coefficient α (<1). The coefficient α may be designated in advance by the user. It is assumed that the CPU usage rate of a server i included in the server group Z is UZ (i). Further, it is assumed that the corrected CPU usage rate obtained by upward correction is UZ ′ (i). At this time, the corrected CPU usage rate UZ ′ (i) is given by the following equation (5).

次に、サーバグループＧに関するＣＰＵ使用率Ｕの修正を考える。サーバグループＧに含まれるサーバの処理中リクエスト数Ｃは減少傾向にある。処理中リクエスト数Ｃが減少傾向にあるということは、後処理に伴う負荷が将来的に減少することを意味する。言い換えれば、後処理の減少によって、将来的にはＣＰＵ使用率Ｕがより低くなる確率が極めて高い。従って、後処理に伴う負荷の減少を見込んで、サーバ負荷算出部３０は、ＣＰＵ使用率Ｕを「下方修正」する。 Next, consider the correction of the CPU usage rate U for the server group G. The number of requests C being processed by the servers included in the server group G tends to decrease. The fact that the number C of requests being processed tends to decrease means that the load accompanying post-processing will decrease in the future. In other words, there is a very high probability that the CPU usage rate U will become lower in the future due to the decrease in post-processing. Accordingly, the serverload calculation unit 30 “corrects downward” the CPU usage rate U in anticipation of a decrease in load accompanying post-processing.

例えば、サーバ負荷算出部３０は、所定の比率だけＣＰＵ使用率Ｕを下方修正することができる。その所定の比率は、後処理が含まれない場合の負荷と後処理も含めた場合の負荷との間の比率を反映していることが望ましい。例えば、上述の係数α＝ｓ１／（ｓ１＋ｓ２）が用いられる。サーバ負荷算出部３０は、ＣＰＵ使用率Ｕに係数α（＜１）を掛けることによって、ＣＰＵ使用率Ｕを下方修正することができる。サーバグループＧに含まれるあるサーバｉのＣＰＵ使用率はＵＧ（ｉ）であるとする。また、下方修正によって得られる修正後ＣＰＵ使用率はＵＧ’（ｉ）であるとする。このとき、修正後ＣＰＵ使用率ＵＧ’（ｉ）は、次の式（６）で与えられる。 For example, the serverload calculation unit 30 can correct the CPU usage rate U downward by a predetermined ratio. The predetermined ratio desirably reflects the ratio between the load when post-processing is not included and the load when post-processing is also included. For example, the above-described coefficient α = s1 / (s1 + s2) is used. The serverload calculation unit 30 can correct the CPU usage rate U downward by multiplying the CPU usage rate U by a coefficient α (<1). It is assumed that the CPU usage rate of a server i included in the server group G is UG (i). Further, it is assumed that the corrected CPU usage rate obtained by the downward correction is UG ′ (i). At this time, the corrected CPU usage rate UG ′ (i) is given by the following equation (6).

それ以外のサーバグループＥに関して、サーバ負荷算出部３０は特に修正を行わない。サーバグループＥに含まれるあるサーバｉのＣＰＵ使用率はＵＥ（ｉ）であるとする。便宜上、修正後ＣＰＵ使用率ＵＥ’（ｉ）を考えると、その修正後ＣＰＵ使用率ＵＥ’（ｉ）は、次の式（７）で与えられる。 For the other server groups E, the serverload calculation unit 30 does not particularly modify. It is assumed that the CPU usage rate of a server i included in the server group E is UE (i). For convenience, when the corrected CPU usage rate UE ′ (i) is considered, the corrected CPU usage rate UE ′ (i) is given by the following equation (7).

尚、サーバ負荷算出部３０は、サーバ１００−１〜１００−ｎの全てに関してＣＰＵ使用率Ｕの修正を行ってもよいし、そうでなくてもよい。サーバ負荷算出部３０は、サーバ１００−１〜１００−ｎからいくつかのサーバ１００を抽出し、抽出された各サーバ１００に関してＣＰＵ使用率Ｕの修正を行うことができる。例えば、サーバ負荷算出部３０は、サーバグループ管理リストＧＲＰを参照して、サーバグループ（Ｚ，Ｇ，Ｅ）の各々から少なくとも１つのサーバ１００を抽出することができる。また、負荷分散処理では、サーバ負荷が最大である「最大負荷サーバ」や、サーバ負荷が最小である「最小負荷サーバ」を見つけ出すことがとりわけ重要である。従って、サーバ１００−１〜１００−ｎの全てに関してＣＰＵ使用率Ｕの修正を行う必要は必ずしもない。ＣＰＵ使用率Ｕが最大あるいは最小であるサーバに着目し、そのサーバに関してＣＰＵ使用率Ｕの修正を行えば十分である場合もある。 Note that the serverload calculation unit 30 may or may not correct the CPU usage rate U for all of the servers 100-1 to 100-n. The serverload calculation unit 30 can extractseveral servers 100 from the servers 100-1 to 100-n and can correct the CPU usage rate U for each of the extractedservers 100. For example, the serverload calculation unit 30 can extract at least oneserver 100 from each of the server groups (Z, G, E) with reference to the server group management list GRP. In the load balancing process, it is particularly important to find the “maximum load server” having the maximum server load and the “minimum load server” having the minimum server load. Therefore, it is not always necessary to correct the CPU usage rate U for all of the servers 100-1 to 100-n. In some cases, it is sufficient to focus on the server having the maximum or minimum CPU usage rate U and to correct the CPU usage rate U for the server.

本実施の形態が負荷分散処理へ適用される場合、例えば次のようなサーバ抽出が好適である。サーバ負荷算出部３０は、ＣＰＵ使用率ＵとステップＳ２０での判定結果（サーバグループ管理リストＧＲＰ）に基づいて、サーバ１００−１〜１００−ｎからいくつかのサーバを抽出する。具体的には、サーバ負荷算出部３０は、サーバグループ管理リストＧＲＰで示されるサーバグループ（Ｚ，Ｇ，Ｅ）の各々から、ＣＰＵ使用率Ｕが最大であるサーバ及びＣＰＵ使用率Ｕが最小であるサーバのうち少なくとも一方を抽出する。ここでのＣＰＵ使用率Ｕは、上述の修正対象のＣＰＵ使用率Ｕと同じであり、性能指標変化テーブルＴＢＬあるいは図７Ｂで示されたサーバグループ管理リストＧＲＰを参照することにより得られる。そして、サーバ負荷算出部３０は、抽出された各サーバに関してＣＰＵ使用率Ｕを修正し、修正後ＣＰＵ使用率Ｕ’をサーバ負荷として算出する。 When the present embodiment is applied to load distribution processing, for example, the following server extraction is suitable. The serverload calculation unit 30 extracts several servers from the servers 100-1 to 100-n based on the CPU usage rate U and the determination result (server group management list GRP) in step S20. Specifically, the serverload calculation unit 30 determines that the server having the maximum CPU usage rate U and the CPU usage rate U are minimum from each of the server groups (Z, G, E) indicated by the server group management list GRP. Extract at least one of the servers. The CPU usage rate U here is the same as the CPU usage rate U to be corrected, and is obtained by referring to the performance index change table TBL or the server group management list GRP shown in FIG. 7B. Then, the serverload calculation unit 30 corrects the CPU usage rate U for each extracted server, and calculates the corrected CPU usage rate U ′ as the server load.

以上に説明されたステップＳ３０の結果、すなわち、算出された修正後ＣＰＵ使用率Ｕ’は、ディスプレイ（出力装置６）に表示されてもよい。ユーザは、その表示を参照することにより、後処理に伴う負荷が考慮された正確なサーバ負荷を把握することができる。また、ユーザは、入力装置５を用いて、修正後ＣＰＵ使用率Ｕ’（サーバ負荷）の分析を行うこともできる。 The result of step S30 described above, that is, the calculated post-correction CPU usage rate U 'may be displayed on the display (output device 6). By referring to the display, the user can grasp the accurate server load in consideration of the load associated with the post-processing. The user can also analyze the corrected CPU usage rate U ′ (server load) using theinput device 5.

ステップＳ４０：
サーバ選択部４０は、ステップＳ３０で抽出されたサーバ（修正後ＣＰＵ使用率Ｕ’が算出されたサーバ）から、所定の条件に合うサーバを選択する。例えば、サーバ選択部４０は、ステップＳ３０で抽出されたサーバから、「最大負荷サーバ」及び「最小負荷サーバ」の少なくとも１つを選択する。ここで、最大負荷サーバは、修正後ＣＰＵ使用率Ｕ’が最大であるサーバであり、最小負荷サーバは、修正後ＣＰＵ使用率Ｕ’が最小であるサーバである。最大負荷サーバが選択される場合、先行するステップＳ３０において、サーバグループ（Ｚ，Ｇ，Ｅ）の各々からＣＰＵ使用率Ｕが最大であるサーバが少なくとも抽出されていることが望ましい。最小負荷サーバが選択される場合、先行するステップＳ３０において、サーバグループ（Ｚ，Ｇ，Ｅ）の各々からＣＰＵ使用率Ｕが最小であるサーバが少なくとも抽出されていることが望ましい。Step S40:
Theserver selection unit 40 selects a server that meets a predetermined condition from the servers extracted in step S30 (the server for which the corrected CPU usage rate U ′ has been calculated). For example, theserver selection unit 40 selects at least one of “maximum load server” and “minimum load server” from the servers extracted in step S30. Here, the maximum load server is a server having the maximum corrected CPU usage rate U ′, and the minimum load server is a server having the minimum corrected CPU usage rate U ′. When the maximum load server is selected, it is desirable that at least the server having the maximum CPU usage rate U is extracted from each of the server groups (Z, G, E) in the preceding step S30. When the minimum load server is selected, in the preceding step S30, it is desirable that at least the server having the minimum CPU usage rate U is extracted from each of the server groups (Z, G, E).

ステップＳ４０において最大負荷サーバ及び／あるいは最小負荷サーバを選択することは、負荷分散処理に有用である。ステップＳ４０の結果、すなわち、選択されたサーバは、ディスプレイ（出力装置６）に表示されてもよい。ユーザは、その表示を参照することにより、最大負荷サーバや最小負荷サーバを把握することができる。 Selecting the maximum load server and / or the minimum load server in step S40 is useful for load distribution processing. The result of step S40, that is, the selected server may be displayed on the display (output device 6). The user can grasp the maximum load server and the minimum load server by referring to the display.

４．効果
以上に説明されたように、本実施の形態によれば、処理中リクエスト数Ｃが増加傾向か減少傾向かの判定が行われ、その判定結果に基づいてＣＰＵ使用率Ｕの修正が行われる。これにより、後処理に伴う負荷が、サーバ負荷の推定値に適切に組み込まれる。つまり、計算機システムにおいて後処理が発生する場合のサーバ負荷をより正確に算出することが可能となる。後処理を考慮することによってサーバ負荷が適切に算出されるため、最小負荷サーバ及び／あるいは最大負荷サーバを正しく決定することができる。結果として、計算機システムにおいて適切な負荷分散が可能となる。4). Effect As described above, according to the present embodiment, it is determined whether the number C of requests being processed is increasing or decreasing, and the CPU usage rate U is corrected based on the determination result. . Thereby, the load accompanying post-processing is appropriately incorporated in the estimated value of the server load. That is, it is possible to calculate the server load more accurately when post-processing occurs in the computer system. Since the server load is appropriately calculated by considering the post-processing, the minimum load server and / or the maximum load server can be correctly determined. As a result, appropriate load distribution can be performed in the computer system.

５．第１の応用例（リクエストの割り当て）
本実施の形態は、サーバへのリクエストの割り当て（振り分け）に応用することができる。図８は、既出のサーバ負荷判定処理に加えてリクエスト割当処理を行うための構成例を示している。既出の図２と同じ構成には同一の符号が付され、重複する説明は適宜省略される。5. First application example (request allocation)
This embodiment can be applied to allocation (distribution) of requests to servers. FIG. 8 shows a configuration example for performing a request allocation process in addition to the already-described server load determination process. The same components as those in FIG. 2 described above are denoted by the same reference numerals, and redundant description will be omitted as appropriate.

図８に示されるように、サーバ負荷管理システム１Ａは、新規リクエストＲＥＱを受け取り、その新規リクエストＲＥＱを複数のサーバ１００−１〜１００−ｎのうち適切な１台のサーバ１００に割り当てる。特に、サーバ負荷管理システム１Ａは、複数のサーバ１００−１〜１００−ｎのうち最小負荷サーバを選択し、選択された最小負荷サーバに新規リクエストＲＥＱを割り当てる。 As illustrated in FIG. 8, the server load management system 1A receives a new request REQ and assigns the new request REQ to an appropriate oneserver 100 among the plurality of servers 100-1 to 100-n. In particular, the server load management system 1A selects a minimum load server among the plurality of servers 100-1 to 100-n, and assigns a new request REQ to the selected minimum load server.

より詳細には、サーバ負荷管理システム１Ａの処理装置２は、サーバ負荷判定プログラムＰＲＯＧ１と共にリクエスト割当プログラムＰＲＯＧ２を実行する。リクエスト割当プログラムＰＲＯＧ２は、処理装置２によって実行されるソフトウェアプログラムである。典型的には、リクエスト割当プログラムＰＲＯＧ２は、コンピュータ読み取り可能な記録媒体に記録されている。処理装置２は、その記録媒体からリクエスト割当プログラムＰＲＯＧ２を読み出し、実行する。 More specifically, theprocessing device 2 of the server load management system 1A executes the request allocation program PROG2 together with the server load determination program PROG1. The request allocation program PROG2 is a software program executed by theprocessing device 2. Typically, the request assignment program PROG2 is recorded on a computer-readable recording medium. Theprocessing device 2 reads the request allocation program PROG2 from the recording medium and executes it.

図９は、本例に係る処理装置２の機能を示すブロック図である。図９に示されるように、処理装置２は、既出の性能指標蓄積部１０、傾向判定部２０、サーバ負荷算出部３０及びサーバ選択部４０に加えて、リクエスト割当部５０を有している。リクエスト割当部５０は、処理装置２とリクエスト割当プログラムＰＲＯＧ２との協働によって実現される。 FIG. 9 is a block diagram illustrating functions of theprocessing apparatus 2 according to the present example. As illustrated in FIG. 9, theprocessing device 2 includes arequest allocation unit 50 in addition to the performanceindex accumulation unit 10, thetrend determination unit 20, the serverload calculation unit 30, and theserver selection unit 40. Therequest allocation unit 50 is realized by cooperation between theprocessing device 2 and the request allocation program PROG2.

図１０は、本例に係るサーバ負荷管理システム１Ａによる処理を示すフローチャートである。ステップＳ１０〜Ｓ４０は、既出の実施の形態と同様である。但し、ステップＳ３０においてサーバ負荷算出部３０によって抽出されるサーバは、各サーバグループ（Ｚ，Ｇ，Ｅ）のうちＣＰＵ使用率Ｕが最小であるサーバを含んでいる。つまり、サーバ負荷算出部３０は、少なくとも、サーバグループＺからＣＰＵ使用率Ｕが最小であるサーバを抽出し、サーバグループＧからＣＰＵ使用率Ｕが最小であるサーバを抽出し、サーバグループＥからＣＰＵ使用率Ｕが最小であるサーバを抽出する。そして、サーバ負荷算出部３０は、抽出されたサーバの各々に関して修正後ＣＰＵ使用率Ｕ’を算出する。また、ステップＳ４０において、サーバ選択部４０は、上記抽出されたサーバから、修正後ＣＰＵ使用率Ｕ’が最小である最小負荷サーバを選択する。 FIG. 10 is a flowchart showing processing by the server load management system 1A according to this example. Steps S10 to S40 are the same as those in the above-described embodiment. However, the server extracted by the serverload calculation unit 30 in step S30 includes a server having a minimum CPU usage rate U among the server groups (Z, G, E). That is, the serverload calculation unit 30 extracts at least the server with the lowest CPU usage rate U from the server group Z, extracts the server with the lowest CPU usage rate U from the server group G, and extracts the CPU from the server group E with the CPU. The server with the lowest usage rate U is extracted. Then, the serverload calculation unit 30 calculates the corrected CPU usage rate U ′ for each of the extracted servers. In step S40, theserver selection unit 40 selects the minimum load server having the smallest corrected CPU usage rate U 'from the extracted servers.

ステップＳ５０において、リクエスト割当部５０は、新規リクエストＲＥＱを受け取る。そして、リクエスト割当部５０は、その新規リクエストＲＥＱを、ステップＳ４０において選択された最小負荷サーバに割り当てる。そして、リクエスト割当部５０は、新規リクエストＲＥＱをその最小負荷サーバに送信する。図８で示された例では、最小負荷サーバはサーバ１００−２である。ステップＳ４０において選択される最小負荷サーバは、時間経過とともに更新される。それに伴い、新規リクエストＲＥＱが送信される最小負荷サーバも変化する。結果として、複数のサーバ１００−１〜１００−ｎ間での負荷分散が実現される。 In step S50, therequest allocation unit 50 receives a new request REQ. Then, therequest allocation unit 50 allocates the new request REQ to the minimum load server selected in step S40. Then, therequest allocation unit 50 transmits a new request REQ to the minimum load server. In the example shown in FIG. 8, the minimum load server is the server 100-2. The minimum load server selected in step S40 is updated over time. Accordingly, the minimum load server to which the new request REQ is transmitted also changes. As a result, load distribution among the plurality of servers 100-1 to 100-n is realized.

本実施の形態によれば、後処理に伴う負荷を考慮して、最小負荷サーバが正しく選択される。これを利用することによって、後処理に伴う負荷をも考慮した適切な負荷分散が可能となる。尚、サーバ負荷判定処理（ステップＳ１０〜Ｓ４０）とリクエスト割当処理（ステップＳ５０）は、別々の装置（コンピュータ）により実行されてもよい。 According to the present embodiment, the minimum load server is correctly selected in consideration of the load accompanying post-processing. By using this, appropriate load distribution can be performed in consideration of the load accompanying post-processing. The server load determination process (steps S10 to S40) and the request allocation process (step S50) may be executed by separate devices (computers).

６．第２の応用例（サーバ間の処理の移行）
本実施の形態は、サーバ間の処理の移行に応用することもできる。図１１は、既出のサーバ負荷判定処理に加えて移行処理を行うための構成例を示している。既出の図２と同じ構成には同一の符号が付され、重複する説明は適宜省略される。6). Second application example (migration of processing between servers)
This embodiment can also be applied to the transfer of processing between servers. FIG. 11 shows a configuration example for performing the migration process in addition to the already-described server load determination process. The same components as those in FIG. 2 described above are denoted by the same reference numerals, and redundant description will be omitted as appropriate.

本例において、サーバ負荷管理システム１Ｂの処理装置２は、サーバ負荷判定プログラムＰＲＯＧ１と共に処理移行プログラムＰＲＯＧ３を実行する。処理移行プログラムＰＲＯＧ３は、処理装置２によって実行されるソフトウェアプログラムである。典型的には、処理移行プログラムＰＲＯＧ３は、コンピュータ読み取り可能な記録媒体に記録されている。処理装置２は、その記録媒体から処理移行プログラムＰＲＯＧ３を読み出し、実行する。 In this example, theprocessing device 2 of the server load management system 1B executes the process migration program PROG3 together with the server load determination program PROG1. The process migration program PROG3 is a software program executed by theprocessing device 2. Typically, the process transfer program PROG3 is recorded on a computer-readable recording medium. Theprocessing device 2 reads the processing migration program PROG3 from the recording medium and executes it.

図１２は、本例に係る処理装置２の機能を示すブロック図である。図１２に示されるように、処理装置２は、既出の性能指標蓄積部１０、傾向判定部２０、サーバ負荷算出部３０及びサーバ選択部４０に加えて、処理移行部６０を有している。処理移行部６０は、処理装置２と処理移行プログラムＰＲＯＧ３との協働によって実現される。 FIG. 12 is a block diagram illustrating functions of theprocessing apparatus 2 according to the present example. As illustrated in FIG. 12, theprocessing device 2 includes aprocess transition unit 60 in addition to the performanceindex accumulation unit 10, thetrend determination unit 20, the serverload calculation unit 30, and theserver selection unit 40. Theprocess transfer unit 60 is realized by cooperation between theprocessing device 2 and the process transfer program PROG3.

図１３は、本例に係るサーバ負荷管理システム１Ｂによる処理を示すフローチャートである。ステップＳ１０〜Ｓ４０は、既出の実施の形態と同様である。但し、ステップＳ３０においてサーバ負荷算出部３０によって抽出されるサーバは、各サーバグループ（Ｚ，Ｇ，Ｅ）のうちＣＰＵ使用率Ｕが最大であるサーバと最小であるサーバの両方を含んでいる。つまり、サーバ負荷算出部３０は、少なくとも、サーバグループＺからＣＰＵ使用率Ｕが最大であるサーバと最小であるサーバを抽出し、サーバグループＧからＣＰＵ使用率Ｕが最大であるサーバと最小であるサーバを抽出し、サーバグループＥからＣＰＵ使用率Ｕが最大であるサーバと最小であるサーバを抽出する。そして、サーバ負荷算出部３０は、抽出されたサーバの各々に関して修正後ＣＰＵ使用率Ｕ’を算出する。また、ステップＳ４０において、サーバ選択部４０は、上記抽出されたサーバから、修正後ＣＰＵ使用率Ｕ’が最大である最大負荷サーバ（第１サーバ）と、修正後ＣＰＵ使用率Ｕ’が最小である最小負荷サーバ（第２サーバ）の両方を選択する。 FIG. 13 is a flowchart showing processing by the server load management system 1B according to the present example. Steps S10 to S40 are the same as those in the above-described embodiment. However, the server extracted by the serverload calculation unit 30 in step S30 includes both the server having the maximum CPU usage rate U and the server having the minimum CPU usage rate U among the server groups (Z, G, E). In other words, the serverload calculation unit 30 extracts at least the server with the largest CPU usage rate U and the smallest server from the server group Z, and the server with the smallest CPU usage rate U from the server group G. A server is extracted, and a server having the maximum CPU usage rate U and a server having the minimum CPU usage rate U are extracted from the server group E. Then, the serverload calculation unit 30 calculates the corrected CPU usage rate U ′ for each of the extracted servers. In step S40, theserver selection unit 40 determines that the maximum load server (first server) having the highest corrected CPU usage rate U ′ and the lowest corrected CPU usage rate U ′ from the extracted servers. Select both minimum load servers (second servers).

ステップＳ６０において、処理移行部６０は、最大負荷サーバで実行中の処理のいくつかを、最小負荷サーバに移行（移動）する。このような移行処理は、例えば、特開２００４−０７８４６５号公報に記載された「プロセスマイグレーション方法」により実現可能である。図１１で示された例では、最大負荷サーバはサーバ１００−１であり、最小負荷サーバはサーバ１００−２である。よって、処理移行部６０は、サーバ１００−１で実行中の処理のいくつかを、サーバ１００−２に移行する。ステップＳ４０において選択される最大負荷サーバ及び最小負荷サーバは、時間経過とともに更新される。それに伴い、移行が行われる最大負荷サーバ及び最小負荷サーバも変化する。結果として、複数のサーバ１００−１〜１００−ｎ間での負荷分散が実現される。 In step S60, theprocess migration unit 60 migrates (moves) some of the processes being executed on the maximum load server to the minimum load server. Such a migration process can be realized by, for example, a “process migration method” described in JP-A-2004-078465. In the example shown in FIG. 11, the maximum load server is the server 100-1, and the minimum load server is the server 100-2. Therefore, theprocess transfer unit 60 transfers some of the processes being executed on the server 100-1 to the server 100-2. The maximum load server and the minimum load server selected in step S40 are updated over time. Accordingly, the maximum load server and the minimum load server to which the migration is performed also change. As a result, load distribution among the plurality of servers 100-1 to 100-n is realized.

本実施の形態によれば、後処理に伴う負荷を考慮して、最大負荷サーバ及び最小負荷サーバが正しく選択される。これを利用することによって、後処理に伴う負荷をも考慮した適切な負荷分散が可能となる。尚、サーバ負荷判定処理（ステップＳ１０〜Ｓ４０）と移行処理（ステップＳ６０）は、別々の装置（コンピュータ）により実行されてもよい。 According to the present embodiment, the maximum load server and the minimum load server are correctly selected in consideration of the load accompanying post-processing. By using this, appropriate load distribution can be performed in consideration of the load accompanying post-processing. The server load determination process (steps S10 to S40) and the migration process (step S60) may be executed by separate devices (computers).

以上、本発明の実施の形態が添付の図面を参照することにより説明された。但し、本発明は、上述の実施の形態に限定されず、要旨を逸脱しない範囲で当業者により適宜変更され得る。 The embodiments of the present invention have been described above with reference to the accompanying drawings. However, the present invention is not limited to the above-described embodiments, and can be appropriately changed by those skilled in the art without departing from the scope of the invention.

図１Ａは、ＣＰＵ使用率と処理中リクエスト数との関係を示すグラフである。FIG. 1A is a graph showing the relationship between the CPU usage rate and the number of requests being processed.図１Ｂは、ＣＰＵ使用率と単位時間あたりのリクエスト到着数との関係を示すグラフである。FIG. 1B is a graph showing the relationship between the CPU usage rate and the number of request arrivals per unit time.図２は、本発明の実施の形態に係るサーバ負荷管理システムの構成例を示すブロック図である。FIG. 2 is a block diagram showing a configuration example of the server load management system according to the embodiment of the present invention.図３は、本実施の形態に係る処理装置の機能を示すブロック図である。FIG. 3 is a block diagram illustrating functions of the processing apparatus according to the present embodiment.図４は、本実施の形態に係るサーバ負荷管理システムによる処理を示すフローチャートである。FIG. 4 is a flowchart showing processing by the server load management system according to the present embodiment.図５Ａは、本実施の形態に係る性能指標変化テーブルを示す概念図である。FIG. 5A is a conceptual diagram showing a performance index change table according to the present embodiment.図５Ｂは、本実施の形態に係る性能指標変化テーブルの一例を示している。FIG. 5B shows an example of the performance index change table according to the present embodiment.図６は、本実施の形態におけるステップＳ２０の処理の一例を示すフローチャートである。FIG. 6 is a flowchart showing an example of the process of step S20 in the present embodiment.図７Ａは、本実施の形態に係るサーバグループ管理リストの一例を示す概念図である。FIG. 7A is a conceptual diagram showing an example of a server group management list according to the present embodiment.図７Ｂは、本実施の形態に係るサーバグループ管理リストの他の例を示す概念図である。FIG. 7B is a conceptual diagram showing another example of the server group management list according to the present embodiment.図８は、本実施の形態の第１の応用例に係るサーバ負荷管理システムの構成例を示すブロック図である。FIG. 8 is a block diagram illustrating a configuration example of the server load management system according to the first application example of the present embodiment.図９は、第１の応用例に係る処理装置の機能を示すブロック図である。FIG. 9 is a block diagram illustrating functions of the processing apparatus according to the first application example.図１０は、第１の応用例に係るサーバ負荷管理システムによる処理を示すフローチャートである。FIG. 10 is a flowchart showing processing by the server load management system according to the first application example.図１１は、本実施の形態の第２の応用例に係るサーバ負荷管理システムの構成例を示すブロック図である。FIG. 11 is a block diagram illustrating a configuration example of a server load management system according to a second application example of the present embodiment.図１２は、第２の応用例に係る処理装置の機能を示すブロック図である。FIG. 12 is a block diagram illustrating functions of the processing apparatus according to the second application example.図１３は、第２の応用例に係るサーバ負荷管理システムによる処理を示すフローチャートである。FIG. 13 is a flowchart illustrating processing by the server load management system according to the second application example.

符号の説明Explanation of symbols

１，１Ａ，１Ｂサーバ負荷管理システム
２処理装置
３記憶装置
４通信装置
５入力装置
６出力装置
１０性能指標蓄積部
２０傾向判定部
３０サーバ負荷算出部
４０サーバ選択部
５０リクエスト割当部
６０処理移行部
１００サーバ
１１０監視部
ＰＩ性能指標データ
ＲＩリクエストインデックス
ＴＢＬ性能指標変化テーブル
ＧＲＰサーバグループ管理リスト
ＰＲＯＧ１サーバ負荷判定プログラム
ＰＲＯＧ２リクエスト割当プログラム
ＰＲＯＧ３処理移行プログラム1, 1A, 1B Serverload management system 2Processing device 3Storage device 4Communication device 5 Input device 6Output device 10 Performanceindex storage unit 20Trend determination unit 30 Serverload calculation unit 40Server selection unit 50Request allocation unit 60Process transfer unit 100server 110 monitoring unit PI performance index data RI request index TBL performance index change table GRP server group management list PROG1 server load determination program PROG2 request allocation program PROG3 process migration program