Movatterモバイル変換

RFC 9411	Benchmarking Network Security Devices	March 2023
Balarajah, et al.	Informational	[Page]

4.Test Setup

The test setup defined in this document applies to all benchmarking tests described inSection 7. The test setupMUST be contained within an isolated test environment (seeSection 3 of [RFC6815]).¶

Testbed configurationMUST ensure that any performance implications that are discovered during the benchmark testing aren't due to the inherent physical network limitations, such as the number of physical links and forwarding performance capabilities (throughput and latency) of the network devices in the testbed. For this reason, this document recommends avoiding external devices, such as switches and routers, in the testbed wherever possible.¶

In some deployment scenarios, the network security devices (DUT/SUT) are connected to routers and switches, which will reduce the number of entries in MAC (Media Access Control) or Address Resolution Protocol / Neighbor Discovery (ARP/ND) tables of the DUT/SUT. If MAC or ARP/ND tables have many entries, this may impact the actual DUT/SUT performance due to MAC and ARP/ND table lookup processes. This document also recommends using test equipment with the capability of emulating layer 3 routing functionality instead of adding external routers in the testbed.¶

The testbed setup for Option 1 (Figure 1) is theRECOMMENDED testbed setup for the benchmarking test.¶

+-----------------------+                   +-----------------------+| +-------------------+ |   +-----------+   | +-------------------+ || | Emulated Router(s)| |   |           |   | | Emulated Router(s)| || |    (Optional)     | +----- DUT/SUT  +-----+    (Optional)     | || +-------------------+ |   |           |   | +-------------------+ || +-------------------+ |   +-----------+   | +-------------------+ || |     Clients       | |                   | |      Servers      | || +-------------------+ |                   | +-------------------+ ||                       |                   |                       ||   Test Equipment      |                   |   Test Equipment      |+-----------------------+                   +-----------------------+

Figure 1:Testbed Setup - Option 1

If the test equipment used is not capable of emulating OSI layer 3 routing functionality or if the number of used ports is mismatched between the test equipment and the DUT/SUT (which is needed for test equipment port aggregation), the test setup can be configured as shown inFigure 2.¶

 +-------------------+      +-----------+      +--------------------+ |Aggregation Switch/|      |           |      | Aggregation Switch/| | Router            +------+  DUT/SUT  +------+ Router             | |                   |      |           |      |                    | +----------+--------+      +-----------+      +--------+-----------+            |                                           |            |                                           |+-----------+-----------+                   +-----------+-----------+|                       |                   |                       || +-------------------+ |                   | +-------------------+ || | Emulated Router(s)| |                   | | Emulated Router(s)| || |     (Optional)    | |                   | |     (Optional)    | || +-------------------+ |                   | +-------------------+ || +-------------------+ |                   | +-------------------+ || |      Clients      | |                   | |      Servers      | || +-------------------+ |                   | +-------------------+ ||                       |                   |                       ||    Test Equipment     |                   |    Test Equipment     |+-----------------------+                   +-----------------------+

Figure 2:Testbed Setup - Option 2

4.2.DUT/SUT Configuration

The same DUT/SUT configurationMUST be used for all benchmarking tests described inSection 7. Since each DUT/SUT will have its own unique configuration, usersMUST configure their devices with the same parameters and security features that would be used in the actual deployment of the device or a typical deployment. The DUT/SUTMUST be configured in "Inline" mode so that the traffic is actively inspected by the DUT/SUT.¶

Tables2 and3 below describe theRECOMMENDED andOPTIONAL sets of network security features for NGFWs and NGIPSs, respectively. If the recommended security features are not enabled in the DUT/SUT for any reason, the reasonMUST be reported with the benchmarking test results. For example, one reason for not enabling the anti-virus feature in an NGFW may be that this security feature was not required for a particular customer deployment scenario. ItMUST be also noted in the benchmarking test report that not enabling the specific recommended security features may impact the performance of the DUT/SUT. The selected security featuresMUST be consistently enabled on the DUT/SUT for all benchmarking tests described inSection 7.¶

To improve repeatability, a summary of the DUT/SUT configuration, including a description of all enabled DUT/SUT features,MUST be published with the benchmarking results.¶

The following table provides a brief description of the security feature; these are approximate taxonomies of features commonly found in currently deployed NGFWs and NGIPSs. The features provided by specific implementations may be named differently and not necessarily have configuration settings that align with the taxonomy.¶

Table 1:Security Feature Description
DUT/SUT Features	Description
TLS Inspection	The DUT/SUT intercepts and decrypts inbound HTTPS traffic between servers and clients. Once the content inspection has been completed, the DUT/SUT encrypts the HTTPS traffic with ciphers and keys used by the clients and servers. For TLS 1.3, the DUT works as a middlebox (proxy) and holds the certificates and Pre-Shared Keys (PSKs) that are trusted by the client and represent the identity of the real server.
IDS/IPS	The DUT/SUT detects and blocks exploits targeting known and unknown vulnerabilities across the monitored network.
Anti-Malware	The DUT/SUT detects and prevents the transmission of malicious executable code and any associated communications across the monitored network. This includes data exfiltration as well as command and control channels.
Anti-Spyware	Anti-Spyware is a subcategory of Anti-Malware. Spyware transmits information without the user's knowledge or permission. The DUT/SUT detects and blocks the initial infection or transmission of data.
Anti-Botnet	The DUT/SUT detects and blocks traffic to or from botnets.
Anti-Evasion	The DUT/SUT detects and mitigates attacks that have been obfuscated in some manner.
Web Filtering	The DUT/SUT detects and blocks malicious websites, including defined classifications of websites across the monitored network.
Data Loss Protection (DLP)	The DUT/SUT detects and prevents data breaches and data exfiltration, or it detects and blocks the transmission of sensitive data across the monitored network.
Certificate Validation	The DUT/SUT validates certificates used in encrypted communications across the monitored network.
Logging and Reporting	The DUT/SUT logs and reports all traffic at the flow level across the monitored network.
Application Identification	The DUT/SUT detects known applications as defined within the traffic mix selected across the monitored network.
Deep Packet Inspection (DPI)	The DUT/SUT inspects the content of the data packet.

Table 2:NGFW Security Features
DUT/SUT (NGFW) Features	RECOMMENDED	OPTIONAL
TLS Inspection	x
IDS/IPS	x
Anti-Spyware	x
Anti-Virus	x
Anti-Botnet	x
Anti-Evasion	x
Web Filtering		x
Data Loss Protection (DLP)		x
DDoS Protection		x
Certificate Validation		x
Application Identification	x

Table 3:NGIPS Security Features
DUT/SUT (NGIPS) Features	RECOMMENDED	OPTIONAL
TLS Inspection	x
Anti-Malware	x
Anti-Spyware	x
Anti-Botnet	x
Application Identification	x
Deep Packet Inspection (DPI)	x
Anti-Evasion	x

Note: With respect to TLS Inspection, there are scenarios where it will be optional.¶

Below is a summary of the DUT/SUT configuration:¶

The DUT/SUTMUST be configured in "Inline" mode.¶
"Fail-Open" behaviorMUST be disabled.¶
AllRECOMMENDED security features are enabled.¶
Logging and reportingMUST be enabled. The DUT/SUTSHOULD log all traffic at the flow level (5-tuple). If the DUT/SUT is designed to log all traffic at different levels (e.g., IP packet levels), it is acceptable to conduct tests. However, thisMUST be noted in the test report. Logging to an external device is permissible.¶
Geographical location filteringSHOULD be configured. If the DUT/SUT is not designed to perform geographical location filtering, it is acceptable to conduct tests without this feature. However, thisMUST be noted in the test report.¶
Application Identification and ControlMUST be configured to trigger applications from the defined traffic mix.¶

In addition, a realistic number of access control lists (ACLs)SHOULD be configured on the DUT/SUT where ACLs are configurable and reasonable based on the deployment scenario. For example, it is acceptable not to configure ACLs in an NGIPS since NGIPS devices do not require the use of ACLs in most deployment scenarios. This document determines the number of access policy rules for four different classes of the DUT/SUT: Extra Small (XS), Small (S), Medium (M), and Large (L). A sample DUT/SUT classification is described inAppendix B.¶

The ACLs defined inTable 4MUST be configured from top to bottom in the correct order, as shown in the table. This is due to ACL types listed in specificity-decreasing order, with "block" first, followed by "allow", representing a typical ACL-based security policy. The ACL entriesMUST be configured with routable IP prefixes by the DUT/SUT, where applicable. (Note: There will be differences between how security vendors implement ACL decision making.) The configured ACLMUST NOT block the test traffic used for the benchmarking tests.¶

Table 4:DUT/SUT Access List
				DUT/SUT Classification # Rules
Rules Type	Match Criteria	Description	Action	XS	S	M	L
Application layer	Application	Any application not included in the measurement traffic	block	5	10	20	50
Transport layer	SRC IP and TCP/UDP DST ports	Any SRC IP prefix used and any DST ports not used in the measurement traffic	block	25	50	100	250
IP layer	SRC/DST IP	Any SRC/DST IP subnet not used in the measurement traffic	block	25	50	100	250
Application layer	Application	Half of the applications included in the measurement traffic (see the note below)	allow	10	10	10	10
Transport layer	SRC IP and TCP/UDP DST ports	Half of the SRC IPs used and any DST ports used in the measurement traffic (one rule per subnet)	allow	>1	>1	>1	>1
IP layer	SRC IP	The rest of the SRC IP prefix range used in the measurement traffic (one rule per subnet)	allow	>1	>1	>1	>1

Note 1: Based on the test customer's specific use case, the testers can increase the number of rules.¶

Note 2: If half of the applications included in the test traffic are less than 10, the missing number of ACL entries (placeholder rules) can be configured for any application traffic not included in the test traffic.¶

Note 3: In the event that the DUT/SUT is designed to not use ACLs, it is acceptable to conduct tests without them. However, thisMUST be noted in the test report.¶

4.2.1.Security Effectiveness Configuration

The selected security features (defined in Tables2 and3) of the DUT/SUTMUST be configured effectively to detect, prevent, and report the defined security vulnerability sets. This section defines the selection of the security vulnerability sets from the Common Vulnerabilities and Exposures (CVEs) list[CVE] for testing. The vulnerability set should reflect a minimum of 500 CVEs from no older than 10 calendar years to the current year. These CVEs should be selected with a focus on in-use software commonly found in business applications, with a Common Vulnerability Scoring System (CVSS) Severity of High (7-10).¶

This document is primarily focused on performance benchmarking. However, it isRECOMMENDED to validate the security features configuration of the DUT/SUT by evaluating the security effectiveness as a prerequisite for performance benchmarking tests defined inSection 7. In case the benchmarking tests are performed without evaluating security effectiveness, the test reportMUST explain the implications of this. The methodology for evaluating security effectiveness is defined inAppendix A.¶

4.3.Test Equipment Configuration

In general, test equipment allows configuring parameters in different protocol layers. Extensive proof-of-concept tests conducted to support preparation of this document showed that benchmarking results are strongly affected by the choice of protocol stack parameters, especially OSI layer 4 transport protocol parameters. For more information on how TCP and QUIC parameters will impact performance, review[fastly]. To achieve reproducible results that will be representative of real deployment scenarios, careful specification and documentation of the parameters are required.¶

This section specifies common test equipment configuration parameters applicable for all benchmarking tests defined inSection 7. Any benchmarking-test-specific parameters are described under the test setup section of each benchmarking test individually.¶

4.3.1.Client Configuration

This section specifies which parameters should be considered while configuring emulated client endpoints in the test equipment. Also, this section specifies theRECOMMENDED values for certain parameters. The values are the defaults typically used in most of the client operating system types.¶

Pre-standard evaluations have shown that it is possible to set a wide range of arbitrary parameters for OSI layer 4 transport protocols on test equipment leading to optimization of client-specific results; however, only well-defined common parameter sets help to establish meaningful and comparable benchmarking results. For these reasons, this document recommends specific sets of transport protocol parameters to be configured on test equipment used for benchmarking.¶

4.3.1.1.TCP Stack Attributes

The TCP stack of the emulated client endpointsMUST fulfill the TCP requirements defined inAppendix B of [RFC9293]. In addition, this section specifies theRECOMMENDED values for TCP parameters configured using the parameters described below.¶

The IPv4 and IPv6 Maximum Segment Sizes (MSSs) are set to 1460 bytes and 1440 bytes, respectively. TX and RX initial receive window sizes are set to 65535 bytes. The client's initial congestion window should not exceed 10 times the MSS. Delayed ACKs are permitted, and the maximum client delayed ACK should not exceed 10 times of the MSS before a forced ACK; also, the maximum delayed ACK timer is allowed to be set to 200 ms. Up to three retries are allowed before a timeout event is declared. The TCP PSH flag is set to high in all traffic. The source port range is 1024-65535. The clients initiate TCP connections via a three-way handshake (SYN, SYN/ACK, ACK) and close TCP connections via either a TCP three-way close (FIN, FIN/ACK, ACK) or a TCP four-way close (FIN, ACK, FIN, ACK).¶

4.3.1.2.QUIC Specification

QUIC stack emulation on the test equipmentMUST conform to[RFC9000] and[RFC9001]. This section specifies theRECOMMENDED values for certain QUIC parameters to be configured on test equipment used for benchmarking purposes only. The QUIC stream type (defined inSection 2.1 of [RFC9000]) is set to "Client-Initiated, Bidirectional". 0-RTT and early data are disabled. The QUIC connection termination method is an immediate close (Section 10.2 of [RFC9000]). Flow control is enabled. UDP payloads are set to the datagram size of 1232 bytes for IPv6 and 1252 bytes for IPv4. In addition, transport parameters and default values defined inSection 18.2 of [RFC9000] areRECOMMENDED to configure on test equipment. Also, this document references AppendicesB.1 andB.2 of[RFC9002] for congestion-control-related constants and variables. Any configured QUIC and UDP parameterMUST be documented in the test report.¶

4.3.1.3.Client IP Address Space

The client IP space contains the following attributes.¶

If multiple IP blocks are used, theyMUST consist of multiple unique, discontinuous static address blocks.¶
A default gatewayMAY be used.¶
The differentiated services code point (DSCP) marking should be set to Default Forwarding (DF) '000000' on the IPv4 Type of Service (ToS) field and IPv6 Traffic Class field.¶
One or more extension headersMAY be used for IPv6 clients. If multiple extension headers are needed for traffic emulation, this document references[RFC8200] to choose the correct order of the extension headers within an IPv6 packet. Testing with one or more extension headers may impact the performance of the DUT. The extension headersMUST be documented and reported.¶

The following equation can be used to define the total number of client IP addresses that need to be configured on the test equipment.¶

Desired total number of client IP addresses = Target throughput [Mbit/s] / Average throughput per IP address [Mbit/s]¶

As shown in the example list below, the value for "Average throughput per IP address" can be varied depending on the deployment and use case scenario.¶

Example 1: DUT/SUT deployment scenario 1: 6-7 Mbit/s per IP (e.g., 1,400-1,700 IPs per 10 Gbit/s of throughput)¶
Example 2: DUT/SUT deployment scenario 2: 0.1-0.2 Mbit/s per IP (e.g., 50,000-100,000 IPs per 10 Gbit/s of throughput)¶

Client IP addressesMUST be distributed between IPv4 and IPv6 based on the deployment and use case scenario. The following optionsMAY be considered for a selection of ratios for both IP addresses and traffic load distribution.¶

Option 1: 100 % IPv4, no IPv6¶
Option 2: 80 % IPv4, 20% IPv6¶
Option 3: 50 % IPv4, 50% IPv6¶
Option 4: 20 % IPv4, 80% IPv6¶
Option 5: no IPv4, 100% IPv6¶

Note: IANA has assigned IP address ranges for testing purposes, as described inSection 8. If the test scenario requires more IP addresses or subnets than IANA has assigned, this document recommends using private IPv4 address ranges or Unique Local Address (ULA) IPv6 address ranges for the testing.¶

4.3.1.4.Emulated Web Browser Attributes

The client (emulated web browser) contains attributes that will materially affect the traffic load. The objective is to emulate modern, typical browser attributes to improve the relevance of the result set for typical deployment scenarios.¶

The emulated browserMUST negotiate HTTP version 1.1 or higher. The emulated browserSHOULD advertise a User-Agent header. The emulated browserMUST enforce content length validation. HTTP header compressionMAY be set to enable. If HTTP header compression is configurable in the test equipment, itMUST be documented if it was enabled or disabled. Depending on test scenarios and the chosen HTTP version, the emulated browserMAY open multiple TCP or QUIC connections per server endpoint IP at any time, depending on how many sequential transactions need to be processed.¶

For HTTP/2 traffic emulation, the emulated browser opens multiple concurrent streams per connection (multiplexing). For HTTPS requests, the emulated browserMUST send an "h2" protocol identifier using the TLS extension Application-Layer Protocol Negotiation (ALPN). The following default values (see[Undertow]) are theRECOMMENDED settings for certain HTTP/2 parameters to be configured on test equipment used for benchmarking purposes only:¶

Maximum frame size: 16384 bytes¶
Initial window size: 65535 bytes¶
HPACK header field table size: 4096 bytes¶
Server push enable: false (Note: In[Undertow], the default setting is true. However, for testing purposes, this document recommends setting the value to false for server push.)¶

This document refers to[RFC9113] for further details of HTTP/2. If any additional parameters are used to configure the test equipment, theyMUST be documented.¶

For HTTP/3 traffic emulation, the emulated browsers initiate secure QUIC connections using TLS 1.3 ([RFC9001] describes how TLS is used to secure QUIC). This document refers to[RFC9114] for HTTP/3 specifications. The specification for transport protocol parameters is defined inSection 4.3.1.2. QPACK configuration settings, such as MAX_TABLE_CAPACITY and QPACK_BLOCKED_STREAMS, are set to zero (default), as defined in[RFC9204]. Any HTTP/3 parameters used for test equipment configurationMUST be documented.¶

For encrypted traffic, the following attributes are defined as the negotiated encryption parameters. The test clientsMUST use TLS version 1.2 or higher. The TLS record sizeMAY be optimized for the HTTPS response object size, up to a record size of 16 KB. If Server Name Indication (SNI) is required (especially if the server is identified by a domain name), the client endpointMUST send TLS extension SNI information when opening a security tunnel. Each client connectionMUST perform a full TLS handshake, and session reuse or resumptionMUST be disabled. (Note: Real web browsers use session reuse or resumption. However, for testing purposes, this feature must not be used to measure the DUT/SUT performance in the worst-case scenario.)¶

The following ciphers and keys supported by TLS 1.2 areRECOMMENDED for the HTTPS-based benchmarking tests defined inSection 7.¶

ECDHE-ECDSA-AES128-GCM-SHA256 with Prime256v1 (Signature Hash Algorithm: ecdsa_secp256r1_sha256 and Supported group: secp256r1)¶
ECDHE-RSA-AES128-GCM-SHA256 with RSA 2048 (Signature Hash Algorithm: rsa_pkcs1_sha256 and Supported group: secp256r1)¶
ECDHE-ECDSA-AES256-GCM-SHA384 with Secp384r1 (Signature Hash Algorithm: ecdsa_secp384r1_sha384 and Supported group: secp384r1)¶
ECDHE-RSA-AES256-GCM-SHA384 with RSA 4096 (Signature Hash Algorithm: rsa_pkcs1_sha384 and Supported group: secp384r1)¶

Note: The above ciphers and keys were those commonly used for enterprise-grade encryption cipher suites for TLS 1.2 at of the time of publication (2023). Individual certification bodies should use ciphers and keys that reflect evolving use cases. These choicesMUST be documented in the resulting test reports with detailed information on the ciphers and keys used, along with reasons for the choices.¶

IANA recommends the following cipher suites for use with TLS 1.3, as defined in[RFC8446].¶

TLS_AES_128_GCM_SHA256¶
TLS_AES_256_GCM_SHA384¶
TLS_CHACHA20_POLY1305_SHA256¶
TLS_AES_128_CCM_SHA256¶

4.3.2.Backend Server Configuration

This section specifies which parameters should be considered while configuring emulated backend servers using test equipment.¶

4.3.2.1.TCP Stack Attributes

The TCP stack on the server-sideMUST be configured similarly to the client-side configuration described inSection 4.3.1.1.¶

4.3.2.2.QUIC Specification

The QUIC parameters on the server-sideMUST be configured similarly to the client-side configuration. Any configured QUIC parameterMUST be documented in the report.¶

4.3.2.3.Server Endpoint IP Addressing

The sum of the server IP spaceMUST contain the following attributes.¶

The server IP blocksMUST consist of unique, discontinuous static address blocks with one IP per server Fully Qualified Domain Name (FQDN) endpoint per test port.¶
A default gateway is permitted. The DSCP marking is set to DF '000000' on the IPv4 ToS field and IPv6 Traffic Class field. One or more extension headers for the IPv6 server are permitted. If multiple extension headers are required, this document references[RFC8200] to choose the correct order of the extension headers within an IPv6 packet.¶
The server IP address distribution between IPv4 and IPv6MUST be identical to the client IP address distribution ratio.¶

Note: IANA has assigned IP address blocks for the testing purpose described inSection 8. If the test scenario requires more IP addresses or address blocks than IANA has assigned, this document recommends using private IPv4 address ranges or Unique Local Address (ULA) IPv6 address ranges for the testing.¶

4.3.2.4.HTTP/HTTPS Server Pool Endpoint Attributes

The HTTP 1.1 and HTTP/2 server pools listen on TCP ports 80 and 443 for HTTP and HTTPS. The HTTP/3 server pool listens on any UDP port. The serverMUST emulate the same HTTP version (HTTP 1.1, HTTP/2, or HTTP/3) and settings chosen by the client (emulated web browser). For the HTTPS server, TLS version 1.2 or higherMUST be used with a maximum record size of 16 KB. Ticket resumption or session ID reuseMUST NOT be used for TLS 1.2; also, session ticket or session cacheMUST NOT be used for TLS 1.3. The serverMUST serve a certificate to the client. The cipher suite and key size on the server-sideMUST be configured similarly to the client-side configuration described inSection 4.3.1.4.¶

4.3.3.Traffic Flow Definition

At the beginning of the test (the init phase; seeSection 4.3.4), the server endpoint initializes, and the server endpoint will be ready to accept TCP or QUIC connections as well as inbound HTTP and HTTPS requests. The client endpoints initialize and are given attributes such as a MAC and IP address. After the init phase of the test, each client sweeps through the given server IP space, generating a service recognizable by the DUT. Sequential and pseudorandom sweep methods are acceptable. The method usedMUST be stated in the final report. Thus, a balanced mesh between client endpoints and server endpoints will be generated in a client IP and port to server IP and port combination. Each client endpoint performs the same actions as other endpoints, with the difference being the source IP of the client endpoint and the target server IP pool. The clientMUST use the server IP address or FQDN in the host header.¶

4.3.3.1.Description of Intra-Client Behavior

Client endpoints are independent of other clients that are concurrently executing. When a client endpoint initiates traffic, this section describes how the client steps through different services. Once the test is initialized, the client endpoints randomly hold (perform no operation) for a few milliseconds for better randomization of the start of client traffic. Each client (HTTP 1.1 or HTTP/2) will either open a new TCP connection or connect to an HTTP persistent connection that is still open to that specific server. HTTP/3 clients will open UDP streams within QUIC connections. At any point that the traffic profile may require encryption, a TLS encryption tunnel will form, presenting the URL or IP address request to the server. If using SNI, the serverMUST then perform an SNI name check by comparing the proposed FQDN to the domain embedded in the certificate. Only when correct will the server process the HTTPS response object. The initial response object to the server is based on benchmarking tests described inSection 7. Multiple additional sub-URLs (response objects on the service page)MAY be requested simultaneously. ThisMAY be to the same server IP as the initial URL. Each sub-object will also use a canonical FQDN and URL path.¶

4.3.4.Traffic Load Profile

The loading of traffic is described in this section. The loading of a traffic load profile has five phases: Init, ramp up, sustain, ramp down, and collection.¶

Init phase:: Testbed devices, including the client and server endpoints, should negotiate layer 2-3 connectivity, such as MAC learning and ARP/ND. Only after successful MAC learning or ARP/NDSHALL the test iteration move to the next phase. No measurements are made in this phase. The minimum recommended time for the Init phase is 5 seconds. During this phase, the emulated clientsMUST NOT initiate any sessions with the DUT/SUT; in contrast, the emulated servers should be ready to accept requests from the DUT/SUT or emulated clients.¶
Ramp Up phase:: The test equipmentMUST start to generate the test traffic. ItMUST use a set of the approximate number of unique client IP addresses to generate traffic. The trafficMUST ramp up from zero to the desired target objective. The target objective is defined for each benchmarking test. The duration for the ramp up phaseMUST be configured long enough that the test equipment does not overwhelm the DUT's/SUT's stated performance metrics defined inSection 6.3, namely TCP or QUIC connections per second, inspected throughput, concurrent TCP or QUIC connections, and application transactions per second. No measurements are made in this phase.¶
Sustain phase:: This phase starts when all required clients are active and operating at their desired load condition. In the sustain phase, the test equipmentMUST continue generating traffic to a constant target value for a constant number of active clients. The minimumRECOMMENDED time duration for the sustain phase is 300 seconds. This is the phase where measurements occur. The test equipmentMUST measure and record statistics continuously. The sampling interval for collecting the raw results and calculating the statisticsMUST be less than 2 seconds.¶
Ramp Down phase:: The test traffic slows down from the target number to 0, and no measurements are made.¶
Collection phase:: The last phase is administrative and will occur when the test equipment merges and collates the report data.¶

7.Benchmarking Tests

This section mainly focuses on the benchmarking tests with HTTP/1.1 or HTTP/2 traffic, which uses TCP as the transport protocol. In particular, this section does not define specific benchmarking tests for KPIs related to QUIC or HTTP/3. However, the test methodology defined in the benchmarking testsTCP or QUIC connections per second with HTTPS traffic (Section 7.6),HTTPS transaction latency (Section 7.8),HTTPS throughput (Section 7.7), andconcurrent TCP or QUIC connection capacity with HTTPS traffic (Section 7.9) can be used to test KPIs related to QUIC or HTTP/3. The throughput performance test with the application traffic mix defined inSection 7.1 can be performed with any other application traffic, including HTTP/3.¶

7.1.Throughput Performance with Application Traffic Mix

7.1.1.Objective

Using a relevant application traffic mix, determine the sustainable inspected throughput supported by the DUT/SUT.¶

Based on the test customer's specific use case, testers can choose the relevant application traffic mix for this test. The details about the traffic mixMUST be documented in the report. At least, the following traffic mix detailsMUST be documented and reported together with the test results:¶

Name of applications and layer 7 protocols¶
Percentage of emulated traffic for each application and layer 7 protocol¶
Percentage of encrypted traffic and used cipher suites and keys (theRECOMMENDED ciphers and keys are defined inSection 4.3.1.4)¶
Used object sizes for each application and layer 7 protocols¶

7.1.2.Test Setup

The testbed setupMUST be configured as defined inSection 4. Any benchmarking-test-specific testbed configuration changesMUST be documented.¶

7.1.3.Test Parameters

In this section, the benchmarking-test-specific parameters are defined.¶

7.1.3.1.DUT/SUT Configuration Parameters

DUT/SUT parametersMUST conform to the requirements defined inSection 4.2. Any configuration changes for this specific benchmarking testMUST be documented. If the DUT/SUT is configured without TLS inspection, the test reportMUST explain how this impacts the encrypted traffic of the relevant application traffic mix.¶

7.1.3.2.Test Equipment Configuration Parameters

Test equipment configuration parametersMUST conform to the requirements defined inSection 4.3. The following parametersMUST be documented for this benchmarking test:¶

Client IP address ranges defined inSection 4.3.1.3 ¶
Server IP address ranges defined inSection 4.3.2.3 ¶
Traffic distribution ratio between IPv4 and IPv6 defined inSection 4.3.1.3 ¶
Target inspected throughput: Aggregated line rate of one or more interfaces used in the DUT/SUT or the value defined based on the requirement for a specific deployment scenario¶
Initial throughput: 10% of the "Target inspected throughput"¶
Note: Initial throughput is not a KPI to report. This value is configured on the traffic generator and used to perform Step 1 (Test Initialization and Qualification) described inSection 7.1.4.¶
One of the ciphers and keys defined inSection 4.3.1.4 isRECOMMENDED to use for this benchmarking test.¶

7.1.3.3.Traffic Profile

This testMUST be run with a relevant application traffic mix profile.¶

7.1.3.4.Test Results Validation Criteria

The following criteria are the test results validation criteria. The test results validation criteriaMUST be monitored during the whole sustain phase of the traffic load profile.¶

The number of failed application transactionsMUST be less than 0.001% (1 out of 100,000 transactions) of the attempted transactions.¶
The number of terminated TCP connections due to unexpected TCP RST sent by the DUT/SUTMUST be less than 0.001% (1 out of 100,000 connections) of the total initiated TCP connections.¶
If HTTP/3 is used, the number of failed QUIC connections due to unexpected HTTP/3 error codesMUST be less than 0.001% (1 out of 100,000 connections) of the total initiated QUIC connections.¶

7.1.3.5.Measurement

The following KPI metricsMUST be reported for this benchmarking test:¶

Mandatory KPIs (benchmarks): inspected throughput and application transactions per second¶
Note: The TTLBMUST be reported along with the object size used in the traffic profile.¶
Optional TCP-stack-related KPIs: TCP connections per second, TLS handshake rate, TTFB (minimum, average, and maximum), TTLB (minimum, average, and maximum)¶
Optional QUIC-stack-related KPIs: QUIC connections per second and concurrent QUIC connections¶

7.1.4.Test Procedures and Expected Results

The test procedures are designed to measure the inspected throughput performance of the DUT/SUT at the sustaining period of the traffic load profile. The test procedure consists of three major steps. Step 1 ensures the DUT/SUT is able to reach the performance value (initial throughput) and meets the test results validation criteria when it was very minimally utilized. Step 2 determines whether the DUT/SUT is able to reach the target performance value within the test results validation criteria. Step 3 determines the maximum achievable performance value within the test results validation criteria.¶

This test procedureMAY be repeated multiple times with different IP types: IPv4 only, IPv6 only, and IPv4 and IPv6 mixed traffic distribution.¶

7.1.4.1.Step 1: Test Initialization and Qualification

Verify the link status of all connected physical interfaces. All interfaces are expected to be in "UP" status.¶

Configure the traffic load profile of the test equipment to generate test traffic at the "initial throughput" rate, as described inSection 7.1.3.2. The test equipmentMUST follow the traffic load profile definition described inSection 4.3.4. The DUT/SUTMUST reach the "initial throughput" during the sustain phase. Measure all KPIs, as defined inSection 7.1.3.5. The measured KPIs during the sustain phaseMUST meet all the test results validation criteria defined inSection 7.1.3.4.¶

If the KPI metrics do not meet the test results validation criteria, the test procedureMUST NOT be continued to Step 2.¶

7.1.4.2.Step 2: Test Run with Target Objective

Configure test equipment to generate traffic at the "Target inspected throughput" rate defined inSection 7.1.3.2. The test equipmentMUST follow the traffic load profile definition described inSection 4.3.4. The test equipmentMUST start to measure and record all specified KPIs. Continue the test until all traffic profile phases are completed.¶

Within the test results validation criteria, the DUT/SUT is expected to reach the desired value of the target objective ("Target inspected throughput") in the sustain phase. Follow Step 3 if the measured value does not meet the target value or does not fulfill the test results validation criteria.¶

7.1.4.3.Step 3: Test Iteration

Determine the achievable average inspected throughput within the test results validation criteria. The final test iterationMUST be performed for the test duration defined inSection 4.3.4.¶

7.2.TCP Connections Per Second with HTTP Traffic

7.2.1.Objective

Using HTTP traffic, determine the sustainable TCP connection establishment rate supported by the DUT/SUT under different throughput load conditions.¶

To measure connections per second, test iterationsMUST use different fixed HTTP response object sizes (the different load conditions) defined inSection 7.2.3.2.¶

7.2.2.Test Setup

The testbed setupMUST be configured as defined inSection 4. Any specific testbed configuration changes (number of interfaces, interface type, etc.)MUST be documented.¶

7.2.3.Test Parameters

In this section, benchmarking-test-specific parameters are defined.¶

7.2.3.1.DUT/SUT Configuration Parameters

DUT/SUT parametersMUST conform to the requirements defined inSection 4.2. Any configuration changes for this specific benchmarking testMUST be documented.¶

7.2.3.2.Test Equipment Configuration Parameters

Test equipment configuration parametersMUST conform to the requirements defined inSection 4.3. The following parametersMUST be documented for this benchmarking test:¶

Client IP address ranges defined inSection 4.3.1.3 ¶
Server IP address ranges defined inSection 4.3.2.3 ¶
Traffic distribution ratio between IPv4 and IPv6 defined inSection 4.3.1.3 ¶
Target connections per second: Initial value from the product datasheet or the value defined based on the requirement for a specific deployment scenario¶
Initial connections per second: 10% of "Target connections per second"¶
Note: Initial connections per second is not a KPI to report. This value is configured on the traffic generator and used to perform Step 1 (Test Initialization and Qualification) described inSection 7.2.4.¶
TheRECOMMENDED response object sizes are 1, 2, 4, 16, and 64 KB.¶

The clientMUST negotiate HTTP and close the connection with FIN immediately after the completion of one transaction. In each test iteration, the clientMUST send a GET request requesting a fixed HTTP response object size.¶

7.2.3.3.Test Results Validation Criteria

The following criteria are the test results validation criteria. The test results validation criteriaMUST be monitored during the whole sustain phase of the traffic load profile.¶

The number of failed application transactions (receiving any HTTP response code other than 200 OK)MUST be less than 0.001% (1 out of 100,000 transactions) of the attempted transactions.¶
The number of terminated TCP connections due to unexpected TCP RST sent by the DUT/SUTMUST be less than 0.001% (1 out of 100,000 connections) of the total initiated TCP connections.¶
During the sustain phase, trafficMUST be forwarded at a constant rate (it is considered as a constant rate if any deviation of the traffic forwarding rate is less than 5%).¶
Concurrent TCP connectionsMUST be constant during steady state, and any deviation of concurrent TCP connectionsMUST be less than 10%. This confirms the DUT opens and closes TCP connections at approximately the same rate.¶

7.2.3.4.Measurement

TCP connections per secondMUST be reported for each test iteration (for each object size).¶

7.2.4.Test Procedures and Expected Results

The test procedure is designed to measure the DUT/SUT's rate of TCP connections per second during the sustaining period of the traffic load profile. The test procedure consists of three major steps. Step 1 ensures the DUT/SUT is able to reach the performance value (Initial connections per second) and meets the test results validation criteria when it was very minimally utilized. Step 2 determines whether the DUT/SUT is able to reach the target performance value within the test results validation criteria. Step 3 determines the maximum achievable performance value within the test results validation criteria.¶

This test procedureMAY be repeated multiple times with different IP types: IPv4 only, IPv6 only, and IPv4 and IPv6 mixed traffic distribution.¶

7.2.4.1.Step 1: Test Initialization and Qualification

Verify the link status of all connected physical interfaces. All interfaces are expected to be in "UP" status.¶

Configure the traffic load profile of the test equipment to establish "Initial connections per second", as defined inSection 7.2.3.2. The traffic load profileMUST be defined as described inSection 4.3.4.¶

The DUT/SUTMUST reach the "Initial connections per second" before the sustain phase. The measured KPIs during the sustain phaseMUST meet all the test results validation criteria defined inSection 7.2.3.3.¶

If the KPI metrics do not meet the test results validation criteria, the test procedureMUST NOT continue to Step 2.¶

7.2.4.2.Step 2: Test Run with Target Objective

Configure test equipment to establish the target objective ("Target connections per second") defined inSection 7.2.3.2. The test equipmentMUST follow the traffic load profile definition described inSection 4.3.4.¶

During the ramp up and sustain phases of each test iteration, other KPIs, such as inspected throughput, concurrent TCP connections, and application transactions per second,MUST NOT reach the maximum value the DUT/SUT can support. The test results for specific test iterationsMUST NOT be reported as valid results if the abovementioned KPI (especially inspected throughput) reaches the maximum value. (For example, if the test iteration with 64 KB of HTTP response object size reached the maximum inspected throughput limitation of the DUT/SUT, the test iterationMAY be interrupted and the result for 64 KB must not be reported.)¶

The test equipmentMUST start to measure and record all specified KPIs. Continue the test until all traffic profile phases are completed.¶

Within the test results validation criteria, the DUT/SUT is expected to reach the desired value of the target objective ("Target connections per second") in the sustain phase. Follow Step 3 if the measured value does not meet the target value or does not fulfill the test results validation criteria.¶

7.2.4.3.Step 3: Test Iteration

Determine the achievable TCP connections per second within the test results validation criteria.¶

7.3.HTTP Throughput

7.3.1.Objective

Determine the sustainable inspected throughput of the DUT/SUT for HTTP transactions varying the HTTP response object size.¶

7.3.2.Test Setup

The testbed setupMUST be configured as defined inSection 4. Any specific testbed configuration changes (number of interfaces, interface type, etc.)MUST be documented.¶

7.3.3.Test Parameters

In this section, benchmarking-test-specific parameters are defined.¶

7.3.3.1.DUT/SUT Configuration Parameters

DUT/SUT parametersMUST conform to the requirements defined inSection 4.2. Any configuration changes for this specific benchmarking testMUST be documented.¶

7.3.3.2.Test Equipment Configuration Parameters

Test equipment configuration parametersMUST conform to the requirements defined inSection 4.3. The following parametersMUST be documented for this benchmarking test:¶

Client IP address ranges defined inSection 4.3.1.3 ¶
Server IP address ranges defined inSection 4.3.2.3 ¶
Traffic distribution ratio between IPv4 and IPv6 defined inSection 4.3.1.3 ¶
Target inspected throughput: Aggregated line rate of one or more interfaces used in the DUT/SUT or the value defined based on the requirement for a specific deployment scenario¶
Initial throughput: 10% of "Target inspected throughput"¶
Note: Initial throughput is not a KPI to report. This value is configured on the traffic generator and used to perform Step 1 (Test Initialization and Qualification) described inSection 7.3.4.¶
Number of HTTP response object requests (transactions) per connection: 10¶
RECOMMENDED HTTP response object size: 1, 16, 64, and 256 KB and mixed objects defined inTable 5 ¶

Table 5:Mixed Objects
Object size (KB)	Number of requests / Weight
0.2	1
6	1
8	1
9	1
10	1
25	1
26	1
35	1
59	1
347	1

7.3.3.3.Test Results Validation Criteria

The following criteria are the test results validation criteria. The test results validation criteriaMUST be monitored during the whole sustain phase of the traffic load profile.¶

The number of failed application transactions (receiving any HTTP response code other than 200 OK)MUST be less than 0.001% (1 out of 100,000 transactions) of the total attempted transactions.¶
TrafficMUST be forwarded at a constant rate (it is considered as a constant rate if any deviation of the traffic forwarding rate is less than 5%).¶
Concurrent TCP connectionsMUST be constant during steady state, and any deviation of concurrent TCP connectionsMUST be less than 10%. This confirms the DUT opens and closes TCP connections at approximately the same rate.¶

7.3.3.4.Measurement

Inspected throughput and HTTP transactions per secondMUST be reported for each object size.¶

7.3.4.Test Procedures and Expected Results

The test procedure is designed to measure HTTP throughput of the DUT/ SUT. The test procedure consists of three major steps. Step 1 ensures the DUT/SUT is able to reach the performance value (initial throughput) and meets the test results validation criteria when it was very minimally utilized. Step 2 determines whether the DUT/SUT is able to reach the target performance value within the test results validation criteria. Step 3 determines the maximum achievable performance value within the test results validation criteria.¶

This test procedureMAY be repeated multiple times with different IPv4 and IPv6 traffic distributions and HTTP response object sizes.¶

7.3.4.1.Step 1: Test Initialization and Qualification

Verify the link status of all connected physical interfaces. All interfaces are expected to be in "UP" status.¶

Configure the traffic load profile of the test equipment to establish "initial throughput", as defined inSection 7.3.3.2.¶

The traffic load profileMUST be defined as described inSection 4.3.4. The DUT/SUTMUST reach the "initial throughput" during the sustain phase. Measure all KPIs, as defined inSection 7.3.3.4.¶

The measured KPIs during the sustain phaseMUST meet the test results validation criteria "a" defined inSection 7.3.3.3. The test results validation criteria "b" and "c" areOPTIONAL for Step 1.¶

If the KPI metrics do not meet the test results validation criteria, the test procedureMUST NOT be continued to Step 2.¶

7.3.4.2.Step 2: Test Run with Target Objective

Configure test equipment to establish the target objective ("Target inspected throughput") defined inSection 7.3.3.2. The test equipmentMUST start to measure and record all specified KPIs. Continue the test until all traffic profile phases are completed.¶

Within the test results validation criteria, the DUT/SUT is expected to reach the desired value of the target objective in the sustain phase. Follow Step 3 if the measured value does not meet the target value or does not fulfill the test results validation criteria.¶

7.3.4.3.Step 3: Test Iteration

Determine the achievable inspected throughput within the test results validation criteria and measure the KPI metric transactions per second. The final test iterationMUST be performed for the test duration defined inSection 4.3.4.¶

7.4.HTTP Transaction Latency

7.4.1.Objective

Using HTTP traffic, determine the HTTP transaction latency when the DUT is running with sustainable HTTP transactions per second supported by the DUT/SUT under different HTTP response object sizes.¶

Test iterationsMUST be performed with different HTTP response object sizes in two different scenarios: one with a single transaction and the other with multiple transactions within a single TCP connection. For consistency, both the single and multiple transaction testsMUST be configured with the same HTTP version.¶

Scenario 1: The clientMUST negotiate HTTP and close the connection with FIN immediately after the completion of a single transaction (GET and RESPONSE).¶

Scenario 2: The clientMUST negotiate HTTP and close the connection with FIN immediately after the completion of 10 transactions (GET and RESPONSE) within a single TCP connection.¶

7.4.2.Test Setup

The testbed setupMUST be configured as defined inSection 4. Any specific testbed configuration changes (number of interfaces, interface type, etc.)MUST be documented.¶

7.4.3.Test Parameters

In this section, benchmarking-test-specific parameters are defined.¶

7.4.3.1.DUT/SUT Configuration Parameters

DUT/SUT parametersMUST conform to the requirements defined inSection 4.2. Any configuration changes for this specific benchmarking testMUST be documented.¶

7.4.3.2.Test Equipment Configuration Parameters

Test equipment configuration parametersMUST conform to the requirements defined inSection 4.3. The following parametersMUST be documented for this benchmarking test:¶

Client IP address ranges defined inSection 4.3.1.3 ¶
Server IP address ranges defined inSection 4.3.2.3 ¶
Traffic distribution ratio between IPv4 and IPv6 defined inSection 4.3.1.3 ¶
Target objective for scenario 1: 50% of the connections per second measured in the benchmarking testTCP connections per second with HTTP traffic (Section 7.2)¶
Target objective for scenario 2: 50% of the inspected throughput measured in the benchmarking testHTTP throughput (Section 7.3)¶
Initial objective for scenario 1: 10% of "Target objective for scenario 1"¶
Initial objective for scenario 2: 10% of "Target objective for scenario 2"¶
Note: The initial objectives are not KPIs to report. These values are configured on the traffic generator and used to perform Step 1 (Test Initialization and Qualification) described inSection 7.4.4.¶
HTTP transaction per TCP connection: Test scenario 1 with a single transaction and test scenario 2 with 10 transactions¶
HTTP with GET request requesting a single object: TheRECOMMENDED object sizes are 1, 16, and 64 KB. For each test iteration, the clientMUST request a single HTTP response object size.¶

7.4.3.3.Test Results Validation Criteria

The following criteria are the test results validation criteria. The test results validation criteriaMUST be monitored during the whole sustain phase of the traffic load profile.¶

The number of failed application transactions (receiving any HTTP response code other than 200 OK)MUST be less than 0.001% (1 out of 100,000 transactions) of the total attempted transactions.¶
The number of terminated TCP connections due to unexpected TCP RST sent by the DUT/SUTMUST be less than 0.001% (1 out of 100,000 connections) of the total initiated TCP connections.¶
During the sustain phase, trafficMUST be forwarded at a constant rate (it is considered as a constant rate if any deviation of the traffic forwarding rate is less than 5%).¶
Concurrent TCP connectionsMUST be constant during steady state, and any deviation of concurrent TCP connectionsMUST be less than 10%. This confirms the DUT opens and closes TCP connections at approximately the same rate.¶
After ramp up, the DUTMUST achieve the target objectives defined inSection 7.4.3.2 and remain in that state for the entire test duration (sustain phase).¶

7.4.3.4.Measurement

The TTFB (minimum, average, and maximum) and TTLB (minimum, average, and maximum)MUST be reported for each object size.¶

7.4.4.Test Procedures and Expected Results

The test procedure is designed to measure the TTFB or TTLB when the DUT/SUT is operating close to 50% of its maximum achievable connections per second or inspected throughput. The test procedure consists of two major steps. Step 1 ensures the DUT/SUT is able to reach the initial performance values and meets the test results validation criteria when it was very minimally utilized. Step 2 measures the latency values within the test results validation criteria.¶

This test procedureMAY be repeated multiple times with different IP types (IPv4 only, IPv6 only, and IPv4 and IPv6 mixed traffic distribution), HTTP response object sizes, and single and multiple transactions per connection scenarios.¶

7.4.4.1.Step 1: Test Initialization and Qualification

Verify the link status of all connected physical interfaces. All interfaces are expected to be in "UP" status.¶

Configure the traffic load profile of the test equipment to establish the initial objectives, as defined inSection 7.4.3.2. The traffic load profileMUST be defined as described inSection 4.3.4.¶

The DUT/SUTMUST reach the initial objectives before the sustain phase. The measured KPIs during the sustain phaseMUST meet all the test results validation criteria defined inSection 7.4.3.3.¶

If the KPI metrics do not meet the test results validation criteria, the test procedureMUST NOT be continued to Step 2.¶

7.4.4.2.Step 2: Test Run with Target Objective

Configure test equipment to establish the target objectives defined inSection 7.4.3.2. The test equipmentMUST follow the traffic load profile definition described inSection 4.3.4.¶

The test equipmentMUST start to measure and record all specified KPIs. Continue the test until all traffic profile phases are completed.¶

Within the test results validation criteria, the DUT/SUTMUST reach the desired value of the target objective in the sustain phase.¶

Measure the minimum, average, and maximum values of the TTFB and TTLB.¶

7.5.Concurrent TCP Connection Capacity with HTTP Traffic

7.5.1.Objective

Determine the number of concurrent TCP connections that the DUT/SUT sustains when using HTTP traffic.¶

7.5.2.Test Setup

The testbed setupMUST be configured as defined inSection 4. Any specific testbed configuration changes (number of interfaces, interface type, etc.)MUST be documented.¶

7.5.3.Test Parameters

In this section, benchmarking-test-specific parameters are defined.¶

7.5.3.1.DUT/SUT Configuration Parameters

DUT/SUT parametersMUST conform to the requirements defined inSection 4.2. Any configuration changes for this specific benchmarking testMUST be documented.¶

7.5.3.2.Test Equipment Configuration Parameters

Test equipment configuration parametersMUST conform to the requirements defined inSection 4.3. The following parametersMUST be noted for this benchmarking test:¶

Client IP address ranges defined inSection 4.3.1.3 ¶
Server IP address ranges defined inSection 4.3.2.3 ¶
Traffic distribution ratio between IPv4 and IPv6 defined inSection 4.3.1.3 ¶
Target concurrent connection: Initial value from the product datasheet or the value defined based on the requirement for a specific deployment scenario¶
Initial concurrent connection: 10% of "Target concurrent connection"¶
Note: Initial concurrent connection is not a KPI to report. This value is configured on the traffic generator and used to perform Step 1 (Test Initialization and Qualification) described inSection 7.5.4.¶
Maximum connections per second during ramp up phase: 50% of maximum connections per second measured in the benchmarking testTCP connections per second with HTTP traffic (Section 7.2)¶
Ramp up time (in traffic load profile for "Target concurrent connection"): "Target concurrent connection" / "Maximum connections per second during ramp up phase"¶
Ramp up time (in traffic load profile for "Initial concurrent connection"): "Initial concurrent connection" / "Maximum connections per second during ramp up phase"¶

The clientMUST negotiate HTTP, and each clientMAY open multiple concurrent TCP connections per server endpoint IP.¶

Each client sends 10 GET requests requesting 1 KB HTTP response object in the same TCP connection (10 transactions / TCP connections), and the delay (think time) between each transactionMUST be X seconds, where X is as follows.¶

X = ("Ramp up time" + "steady state time") / 10¶

The established connectionsMUST remain open until the ramp down phase of the test. During the ramp down phase, all connectionsMUST be successfully closed with FIN.¶

7.5.3.3.Test Results Validation Criteria

The following criteria are the test results validation criteria. The test results validation criteriaMUST be monitored during the whole sustain phase of the traffic load profile.¶

The number of failed application transactions (receiving any HTTP response code other than 200 OK)MUST be less than 0.001% (1 out of 100,000 transactions) of the total attempted transactions.¶
The number of terminated TCP connections due to unexpected TCP RST sent by the DUT/SUTMUST be less than 0.001% (1 out of 100,000 connections) of the total initiated TCP connections.¶
During the sustain phase, trafficMUST be forwarded at a constant rate (it is considered as a constant rate if any deviation of the traffic forwarding rate is less than 5%).¶

7.5.3.4.Measurement

Average concurrent TCP connectionsMUST be reported for this benchmarking test.¶

7.5.4.Test Procedures and Expected Results

The test procedure is designed to measure the concurrent TCP connection capacity of the DUT/SUT at the sustaining period of the traffic load profile. The test procedure consists of three major steps. Step 1 ensures the DUT/SUT is able to reach the performance value (Initial concurrent connection) and meets the test results validation criteria when it was very minimally utilized. Step 2 determines whether the DUT/SUT is able to reach the target performance value within the test results validation criteria. Step 3 determines the maximum achievable performance value within the test results validation criteria.¶

This test procedureMAY be repeated multiple times with different IPv4 and IPv6 traffic distributions.¶

7.5.4.1.Step 1: Test Initialization and Qualification

Verify the link status of all connected physical interfaces. All interfaces are expected to be in "UP" status.¶

Configure test equipment to establish "Initial concurrent connections" defined inSection 7.5.3.2. Except ramp up time, the traffic load profileMUST be defined as described inSection 4.3.4.¶

During the sustain phase, the DUT/SUTMUST reach the "Initial concurrent connections". The measured KPIs during the sustain phaseMUST meet all the test results validation criteria defined inSection 7.5.3.3.¶

If the KPI metrics do not meet the test results validation criteria, the test procedureMUST NOT be continued to Step 2.¶

7.5.4.2.Step 2: Test Run with Target Objective

Configure test equipment to establish the target objective ("Target concurrent TCP connections"). The test equipmentMUST follow the traffic load profile definition (except ramp up time) as described inSection 4.3.4.¶

During the ramp up and sustain phases, the other KPIs, such as inspected throughput, TCP connections per second, and application transactions per second,MUST NOT reach the maximum value the DUT/SUT can support.¶

The test equipmentMUST start to measure and record KPIs defined inSection 7.5.3.4. Continue the test until all traffic profile phases are completed.¶

7.5.4.3.Step 3: Test Iteration

Determine the achievable concurrent TCP connections capacity within the test results validation criteria.¶

7.6.TCP or QUIC Connections per Second with HTTPS Traffic

7.6.1.Objective

Using HTTPS traffic, determine the sustainable TLS session establishment rate supported by the DUT/SUT under different throughput load conditions.¶

Test iterationsMUST include common cipher suites and key strengths, as well as forward-looking stronger keys. Specific test iterationsMUST include ciphers and keys defined inSection 7.6.3.2.¶

For each cipher suite and key strength, test iterationsMUST use a single HTTPS response object size defined inSection 7.6.3.2 to measure connections per second performance under a variety of DUT/SUT security inspection load conditions.¶

7.6.2.Test Setup

The testbed setupMUST be configured as defined inSection 4. Any specific testbed configuration changes (number of interfaces, interface type, etc.)MUST be documented.¶

7.6.3.Test Parameters

In this section, benchmarking-test-specific parameters are defined.¶

7.6.3.1.DUT/SUT Configuration Parameters

DUT/SUT parametersMUST conform to the requirements defined inSection 4.2. Any configuration changes for this specific benchmarking testMUST be documented.¶

7.6.3.2.Test Equipment Configuration Parameters

Test equipment configuration parametersMUST conform to the requirements defined inSection 4.3. The following parametersMUST be documented for this benchmarking test:¶

Client IP address ranges defined inSection 4.3.1.3 ¶
Server IP address ranges defined inSection 4.3.2.3 ¶
Traffic distribution ratio between IPv4 and IPv6 defined inSection 4.3.1.3 ¶
Target connections per second: Initial value from the product datasheet or the value defined based on the requirement for a specific deployment scenario¶
Initial connections per second: 10% of "Target connections per second"¶
Note: Initial connections per second is not a KPI to report. This value is configured on the traffic generator and used to perform Step 1 (Test Initialization and Qualification) described inSection 7.6.4.)¶
RECOMMENDED ciphers and keys defined inSection 4.3.1.4 ¶
TheRECOMMENDED object sizes are 1, 2, 4, 16, and 64 KB.¶

The clientMUST negotiate HTTPS and close the connection without error immediately after the completion of one transaction. In each test iteration, the clientMUST send a GET request requesting a fixed HTTPS response object size.¶

7.6.3.3.Test Results Validation Criteria

The following criteria are the test results validation criteria. The test results validation criteriaMUST be monitored during the whole test duration.¶

The number of failed application transactions (receiving any HTTP response code other than 200 OK)MUST be less than 0.001% (1 out of 100,000 transactions) of the attempted transactions.¶
The number of terminated TCP connections due to unexpected TCP RST sent by the DUT/SUTMUST be less than 0.001% (1 out of 100,000 connections) of the total initiated TCP connections. If HTTP/3 is used, the number of terminated QUIC connections due to unexpected errorsMUST be less than 0.001% (1 out of 100,000 connections) of the total initiated QUIC connections.¶
During the sustain phase, trafficMUST be forwarded at a constant rate (it is considered as a constant rate if any deviation of the traffic forwarding rate is less than 5%).¶
The concurrent TCP connections generation rateMUST be constant during steady state, and any deviation of concurrent TCP connectionsMUST be less than 10%. If HTTP/3 is used, the concurrent QUIC connections generation rateMUST be constant during steady state, and any deviation of concurrent QUIC connectionsMUST be less than 10%. This confirms the DUT opens and closes connections at approximately the same rate.¶

7.6.3.4.Measurement

If HTTP 1.1 or HTTP/2 is used, TCP connections per secondMUST be reported for each test iteration (for each object size).¶

If HTTP/3 is used, QUIC connections per secondMUST be measured and reported for each test iteration (for each object size).¶

The KPI metric TLS handshake rate can be measured in the test using 1 KB object size.¶

7.6.4.Test Procedures and Expected Results

The test procedure is designed to measure the DUT/SUT's rate of TCP or QUIC connections per second during the sustaining period of the traffic load profile. The test procedure consists of three major steps. Step 1 ensures the DUT/SUT is able to reach the performance value (Initial connections per second) and meets the test results validation criteria when it was very minimally utilized. Step 2 determines whether the DUT/SUT is able to reach the target performance value within the test results validation criteria. Step 3 determines the maximum achievable performance value within the test results validation criteria.¶

This test procedureMAY be repeated multiple times with different IPv4 and IPv6 traffic distributions.¶

7.6.4.1.Step 1: Test Initialization and Qualification

Verify the link status of all connected physical interfaces. All interfaces are expected to be in "UP" status.¶

Configure the traffic load profile of the test equipment to establish "Initial connections per second", as defined inSection 7.6.3.2. The traffic load profileMUST be defined as described inSection 4.3.4.¶

If the KPI metrics do not meet the test results validation criteria, the test procedureMUST NOT be continued to Step 2.¶

7.6.4.2.Step 2: Test Run with Target Objective

Configure test equipment to establish "Target connections per second", as defined inSection 7.6.3.2. The test equipmentMUST follow the traffic load profile definition described inSection 4.3.4.¶

During the ramp up and sustain phases, other KPIs, such as inspected throughput, concurrent TCP or QUIC connections, and application transactions per second,MUST NOT reach the maximum value the DUT/SUT can support. The test results for the specific test iterationMUST NOT be reported as valid results if the abovementioned KPI (especially inspected throughput) reaches the maximum value. (For example, if the test iteration with 64 KB of HTTPS response object size reached the maximum inspected throughput limitation of the DUT, the test iterationMAY be interrupted, and the result for 64 KB should not be reported).¶

The test equipmentMUST start to measure and record all specified KPIs. Continue the test until all traffic profile phases are completed.¶

7.6.4.3.Step 3: Test Iteration

Determine the achievable connections per second within the test results validation criteria.¶

7.7.HTTPS Throughput

7.7.1.Objective

Determine the sustainable inspected throughput of the DUT/SUT for HTTPS transactions by varying the HTTPS response object size.¶

Test iterationsMUST include common cipher suites and key strengths, as well as forward-looking stronger keys. Specific test iterationsMUST include the ciphers and keys defined inSection 7.7.3.2.¶

7.7.2.Test Setup

The testbed setupMUST be configured as defined inSection 4. Any specific testbed configuration changes (number of interfaces, interface type, etc.)MUST be documented.¶

7.7.3.Test Parameters

In this section, benchmarking-test-specific parameters are defined.¶

7.7.3.1.DUT/SUT Configuration Parameters

DUT/SUT parametersMUST conform to the requirements defined inSection 4.2. Any configuration changes for this specific benchmarking testMUST be documented.¶

7.7.3.2.Test Equipment Configuration Parameters

Test equipment configuration parametersMUST conform to the requirements defined inSection 4.3. The following parametersMUST be documented for this benchmarking test:¶

Client IP address ranges defined inSection 4.3.1.3 ¶
Server IP address ranges defined inSection 4.3.2.3 ¶
Traffic distribution ratio between IPv4 and IPv6 defined inSection 4.3.1.3 ¶
Target inspected throughput: Aggregated line rate of one or more interfaces used in the DUT/SUT or the value defined based on the requirement for a specific deployment scenario¶
Initial throughput: 10% of "Target inspected throughput"¶
Note: Initial throughput is not a KPI to report. This value is configured on the traffic generator and used to perform Step 1 (Test Initialization and Qualification) described inSection 7.7.4.¶
Number of HTTPS response object requests (transactions) per connection: 10¶
RECOMMENDED ciphers and keys defined inSection 4.3.1.4 ¶
RECOMMENDED HTTPS response object size: 1, 16, 64, and 256 KB and mixed objects defined inTable 5 ofSection 7.3.3.2 ¶

7.7.3.3.Test Results Validation Criteria

The following criteria are the test results validation criteria. The test results validation criteriaMUST be monitored during the whole sustain phase of the traffic load profile.¶

The number of failed application transactions (receiving any HTTP response code other than 200 OK)MUST be less than 0.001% (1 out of 100,000 transactions) of the attempted transactions.¶
TrafficMUST be generated at a constant rate (it is considered as a constant rate if any deviation of the traffic forwarding rate is less than 5%).¶
The concurrent generated TCP connectionsMUST be constant during steady state, and any deviation of concurrent TCP connectionsMUST be less than 10%. If HTTP/3 is used, the concurrent generated QUIC connectionsMUST be constant during steady state, and any deviation of concurrent QUIC connectionsMUST be less than 10%. This confirms the DUT opens and closes connections at approximately the same rate.¶

7.7.3.4.Measurement

Inspected throughput and HTTPS transactions per secondMUST be reported for each object size.¶

7.7.4.Test Procedures and Expected Results

The test procedure consists of three major steps. Step 1 ensures the DUT/SUT is able to reach the performance value (initial throughput) and meets the test results validation criteria when it was very minimally utilized. Step 2 determines whether the DUT/SUT is able to reach the target performance value within the test results validation criteria. Step 3 determines the maximum achievable performance value within the test results validation criteria.¶

This test procedureMAY be repeated multiple times with different IPv4 and IPv6 traffic distributions and HTTPS response object sizes.¶

7.7.4.1.Step 1: Test Initialization and Qualification

Verify the link status of all connected physical interfaces. All interfaces are expected to be in "UP" status.¶

Configure the traffic load profile of the test equipment to establish "initial throughput", as defined inSection 7.7.3.2.¶

The traffic load profileMUST be defined as described inSection 4.3.4. The DUT/SUTMUST reach the "initial throughput" during the sustain phase. Measure all KPIs, as defined inSection 7.7.3.4.¶

The measured KPIs during the sustain phaseMUST meet the test results validation criteria "a" defined inSection 7.7.3.3. The test results validation criteria "b" and "c" areOPTIONAL for Step 1.¶

If the KPI metrics do not meet the test results validation criteria, the test procedureMUST NOT be continued to Step 2.¶

7.7.4.2.Step 2: Test Run with Target Objective

Configure test equipment to establish the target objective ("Target inspected throughput") defined inSection 7.7.3.2. The test equipmentMUST start to measure and record all specified KPIs. Continue the test until all traffic profile phases are completed.¶

7.7.4.3.Step 3: Test Iteration

Determine the achievable average inspected throughput within the test results validation criteria. The final test iterationMUST be performed for the test duration defined inSection 4.3.4.¶

7.8.HTTPS Transaction Latency

7.8.1.Objective

Using HTTPS traffic, determine the HTTPS transaction latency when the DUT/SUT is running with sustainable HTTPS transactions per second supported by the DUT/SUT under different HTTPS response object sizes.¶

Scenario 1: The clientMUST negotiate HTTPS and close the connection immediately after the completion of a single transaction (GET and RESPONSE).¶

Scenario 2: The clientMUST negotiate HTTPS and close the connection immediately after the completion of 10 transactions (GET and RESPONSE) within a single TCP or QUIC connection.¶

7.8.2.Test Setup

The testbed setupMUST be configured as defined inSection 4. Any specific testbed configuration changes (number of interfaces, interface type, etc.)MUST be documented.¶

7.8.3.Test Parameters

In this section, benchmarking-test-specific parameters are defined.¶

7.8.3.1.DUT/SUT Configuration Parameters

DUT/SUT parametersMUST conform to the requirements defined inSection 4.2. Any configuration changes for this specific benchmarking testMUST be documented.¶

7.8.3.2.Test Equipment Configuration Parameters

Test equipment configuration parametersMUST conform to the requirements defined inSection 4.3. The following parametersMUST be documented for this benchmarking test:¶

Client IP address ranges defined inSection 4.3.1.3 ¶
Server IP address ranges defined inSection 4.3.2.3 ¶
Traffic distribution ratio between IPv4 and IPv6 defined inSection 4.3.1.3 ¶
RECOMMENDED cipher suites and key sizes defined inSection 4.3.1.4 ¶
Target objective for scenario 1: 50% of the connections per second measured in the benchmarking testTCP or QUIC connections per second with HTTPS traffic (Section 7.6)¶
Target objective for scenario 2: 50% of the inspected throughput measured in the benchmarking testHTTPS throughput (Section 7.7)¶
Initial objective for scenario 1: 10% of "Target objective for scenario 1"¶
Initial objective for scenario 2: 10% of "Target objective for scenario 2"¶
Note: The initial objectives are not KPIs to report. These values are configured on the traffic generator and used to perform Step 1 (Test Initialization and Qualification) described inSection 7.8.4.¶
HTTPS transaction per TCP or QUIC connection: Test scenario 1 with a single transaction and scenario 2 with 10 transactions¶
HTTPS with GET request requesting a single object: TheRECOMMENDED object sizes are 1, 16, and 64 KB. For each test iteration, the clientMUST request a single HTTPS response object size.¶

7.8.3.3.Test Results Validation Criteria

The following criteria are the test results validation criteria. The test results validation criteriaMUST be monitored during the whole sustain phase of the traffic load profile.¶

The number of failed application transactions (receiving any HTTP response code other than 200 OK)MUST be less than 0.001% (1 out of 100,000 transactions) of the total attempted transactions.¶
The number of terminated TCP connections due to unexpected TCP RST sent by the DUT/SUTMUST be less than 0.001% (1 out of 100,000 connections) of the total initiated TCP connections. If HTTP/3 is used, the number of terminated QUIC connections due to unexpected errorsMUST be less than 0.001% (1 out of 100,000 connections) of the total initiated QUIC connections.¶
During the sustain phase, trafficMUST be forwarded at a constant rate (it is considered as a constant rate if any deviation of the traffic forwarding rate is less than 5%).¶
Concurrent TCP or QUIC connectionsMUST be constant during steady state, and any deviation of concurrent TCP connectionsMUST be less than 10%. If HTTP/3 is used, the concurrent generated QUIC connectionsMUST be constant during steady state, and any deviation of concurrent QUIC connectionsMUST be less than 10%. This confirms the DUT opens and closes connections at approximately the same rate.¶
After ramp up, the DUT/SUTMUST achieve the target objectives defined in the parameters inSection 7.8.3.2 and remain in that state for the entire test duration (sustain phase).¶

7.8.3.4.Measurement

The TTFB (minimum, average, and maximum) and TTLB (minimum, average, and maximum)MUST be reported for each object size.¶

7.8.4.Test Procedures and Expected Results

The test procedure is designed to measure the TTFB or TTLB when the DUT/SUT is operating close to 50% of its maximum achievable connections per second or inspected throughput. The test procedure consists of two major steps. Step 1 ensures the DUT/SUT is able to reach the initial performance values and meets the test results validation criteria when it is very minimally utilized. Step 2 measures the latency values within the test results validation criteria.¶

This test procedureMAY be repeated multiple times with different IP types (IPv4 only, IPv6 only, and IPv4 and IPv6 mixed traffic distribution), HTTPS response object sizes, and single and multiple transactions per connection scenarios.¶

7.8.4.1.Step 1: Test Initialization and Qualification

Verify the link status of all connected physical interfaces. All interfaces are expected to be in "UP" status.¶

Configure the traffic load profile of the test equipment to establish the initial objectives, as defined inSection 7.8.3.2. The traffic load profileMUST be defined as described inSection 4.3.4.¶

The DUT/SUTMUST reach the initial objectives before the sustain phase. The measured KPIs during the sustain phaseMUST meet all the test results validation criteria defined inSection 7.8.3.3.¶

If the KPI metrics do not meet the test results validation criteria, the test procedureMUST NOT be continued to Step 2.¶

7.8.4.2.Step 2: Test Run with Target Objective

Configure test equipment to establish the target objectives defined inSection 7.8.3.2. The test equipmentMUST follow the traffic load profile definition described inSection 4.3.4.¶

The test equipmentMUST start to measure and record all specified KPIs. Continue the test until all traffic profile phases are completed.¶

Within the test results validation criteria, the DUT/SUTMUST reach the desired value of the target objective in the sustain phase.¶

Measure the minimum, average, and maximum values of the TTFB and TTLB.¶

7.9.Concurrent TCP or QUIC Connection Capacity with HTTPS Traffic

7.9.1.Objective

Determine the number of concurrent TCP or QUIC connections the DUT/SUT sustains when using HTTPS traffic.¶

7.9.2.Test Setup

The testbed setupMUST be configured as defined inSection 4. Any specific testbed configuration changes (number of interfaces, interface type, etc.)MUST be documented.¶

7.9.3.Test Parameters

In this section, benchmarking-test-specific parameters are defined.¶

7.9.3.1.DUT/SUT Configuration Parameters

DUT/SUT parametersMUST conform to the requirements defined inSection 4.2. Any configuration changes for this specific benchmarking testMUST be documented.¶

7.9.3.2.Test Equipment Configuration Parameters

Test equipment configuration parametersMUST conform to the requirements defined inSection 4.3. The following parametersMUST be documented for this benchmarking test:¶

Client IP address ranges defined inSection 4.3.1.3 ¶
Server IP address ranges defined inSection 4.3.2.3 ¶
Traffic distribution ratio between IPv4 and IPv6 defined inSection 4.3.1.3 ¶
RECOMMENDED cipher suites and key sizes defined inSection 4.3.1.4 ¶
Target concurrent connections: Initial value from the product datasheet or the value defined based on the requirement for a specific deployment scenario¶
Initial concurrent connections: 10% of "Target concurrent connections"¶
Note: Initial concurrent connections is not a KPI to report. This value is configured on the traffic generator and used to perform Step 1 (Test Initialization and Qualification) described inSection 7.9.4.¶
Connections per second during ramp up phase: 50% of maximum connections per second measured in the benchmarking testTCP or QUIC connections per second with HTTPS traffic (Section 7.6)¶
Ramp up time (in traffic load profile for "Target concurrent connections"): "Target concurrent connections" / "Maximum connections per second during ramp up phase"¶
Ramp up time (in traffic load profile for "Initial concurrent connections"): "Initial concurrent connections" / "Maximum connections per second during ramp up phase"¶

The clientMUST perform HTTPS transactions with persistence, and each client can open multiple concurrent connections per server endpoint IP.¶

Each client sends 10 GET requests requesting 1 KB HTTPS response objects in the same TCP or QUIC connections (10 transactions/connections), and the delay (think time) between each transactionMUST be X seconds, where X is as follows.¶

X = ("Ramp up time" + "steady state time") / 10¶

The established connectionsMUST remain open until the ramp down phase of the test. During the ramp down phase, all connectionsMUST be successfully closed with FIN.¶

7.9.3.3.Test Results Validation Criteria

The following criteria are the test results validation criteria. The test results validation criteriaMUST be monitored during the whole sustain phase of the traffic load profile.¶

The number of failed application transactions (receiving any HTTP response code other than 200 OK)MUST be less than 0.001% (1 out of 100,000 transactions) of the total attempted transactions.¶
The number of terminated TCP connections due to unexpected TCP RSTs sent by the DUT/SUTMUST be less than 0.001% (1 out of 100,000 connections) of the total initiated TCP connections. If HTTP/3 is used, the number of terminated QUIC connections due to unexpected errorsMUST be less than 0.001% (1 out of 100,000 connections) of the total initiated QUIC connections.¶
During the sustain phase, trafficMUST be forwarded at a constant rate (it is considered as a constant rate if any deviation of the traffic forwarding rate is less than 5%).¶

7.9.3.4.Measurement

Average concurrent TCP or QUIC connectionsMUST be reported for this benchmarking test.¶

7.9.4.Test Procedures and Expected Results

This test procedureMAY be repeated multiple times with different IPv4 and IPv6 traffic distributions.¶

7.9.4.1.Step 1: Test Initialization and Qualification

Verify the link status of all connected physical interfaces. All interfaces are expected to be in "UP" status.¶

Configure test equipment to establish "Initial concurrent connections" defined inSection 7.9.3.2. Except ramp up time, the traffic load profileMUST be defined as described inSection 4.3.4.¶

During the sustain phase, the DUT/SUTMUST reach the "Initial concurrent connections". The measured KPIs during the sustain phaseMUST meet the test results validation criteria "a" and "b" defined inSection 7.9.3.3.¶

If the KPI metrics do not meet the test results validation criteria, the test procedureMUST NOT be continued to Step 2.¶

7.9.4.2.Step 2: Test Run with Target Objective

Configure test equipment to establish the target objective ("Target concurrent connections"). The test equipmentMUST follow the traffic load profile definition (except ramp up time) described inSection 4.3.4.¶

During the ramp up and sustain phases, the other KPIs, such as inspected throughput, TCP or QUIC connections per second, and application transactions per second,MUST NOT reach the maximum value that the DUT/SUT can support.¶

The test equipmentMUST start to measure and record KPIs defined inSection 7.9.3.4. Continue the test until all traffic profile phases are completed.¶

7.9.4.3.Step 3: Test Iteration

Determine the achievable concurrent TCP or QUIC connections within the test results validation criteria.¶

Movatterモバイル変換

RFC 9411

Benchmarking Methodology for Network Security Device Performance