RFC 8316: Autonomic Networking Use Case for Distributed Detection of Service Level Agreement (SLA) Violations 中文翻译

URL : https://datatracker.ietf.org/doc/html/rfc8316
标题 : RFC 8316
翻译类型 : 自动生成

Internet Research Task Force (IRTF)                             J. Nobre
Request for Comments: 8316           University of Vale do Rio dos Sinos
Category: Informational                                     L. Granville
ISSN: 2070-1721                  Federal University of Rio Grande do Sul
                                                                A. Clemm
                                                                  Huawei
                                                      A. Gonzalez Prieto
                                                                  VMware
                                                           February 2018

Internet Research Task Force (IRTF)                             J. Nobre
Request for Comments: 8316           University of Vale do Rio dos Sinos
Category: Informational                                     L. Granville
ISSN: 2070-1721                  Federal University of Rio Grande do Sul
                                                                A. Clemm
                                                                  Huawei
                                                      A. Gonzalez Prieto
                                                                  VMware
                                                           February 2018

Autonomic Networking Use Case for Distributed Detection of Service Level Agreement (SLA) Violations

分布式检测服务级别协议（SLA）冲突的自主网络用例

Abstract

摘要

This document describes an experimental use case that employs autonomic networking for the monitoring of Service Level Agreements (SLAs). The use case is for detecting violations of SLAs in a distributed fashion. It strives to optimize and dynamically adapt the autonomic deployment of active measurement probes in a way that maximizes the likelihood of detecting service-level violations with a given resource budget to perform active measurements. This optimization and adaptation should be done without any outside guidance or intervention.

本文档描述了一个实验性用例，该用例使用自主网络来监控服务级别协议（SLA）。该用例用于以分布式方式检测违反SLA的情况。它致力于优化和动态调整主动测量探测的自主部署，以最大限度地提高在给定资源预算下检测服务级别违规的可能性，从而执行主动测量。这种优化和调整应该在没有任何外部指导或干预的情况下进行。

This document is a product of the IRTF Network Management Research Group (NMRG). It is published for informational purposes.

本文件是IRTF网络管理研究组（NMRG）的产品。它是为了提供信息而发布的。

Status of This Memo

关于下段备忘

This document is not an Internet Standards Track specification; it is published for informational purposes.

本文件不是互联网标准跟踪规范；它是为了提供信息而发布的。

This document is a product of the Internet Research Task Force (IRTF). The IRTF publishes the results of Internet-related research and development activities. These results might not be suitable for deployment. This RFC represents the consensus of the Network Management Research Group of the Internet Research Task Force (IRTF). Documents approved for publication by the IRSG are not candidates for any level of Internet Standard; see Section 2 of RFC 7841.

本文件是互联网研究工作组（IRTF）的产品。IRTF发布互联网相关研究和开发活动的结果。这些结果可能不适合部署。本RFC代表了互联网研究任务组（IRTF）网络管理研究小组的共识。IRSG批准发布的文件不适用于任何级别的互联网标准；见RFC 7841第2节。

Information about the current status of this document, any errata, and how to provide feedback on it may be obtained at https://www.rfc-editor.org/info/rfc8316.

有关本文件当前状态、任何勘误表以及如何提供反馈的信息，请访问https://www.rfc-editor.org/info/rfc8316.

版权公告

This document is subject to BCP 78 and the IETF Trust's Legal Provisions Relating to IETF Documents (https://trustee.ietf.org/license-info) in effect on the date of publication of this document. Please review these documents carefully, as they describe your rights and restrictions with respect to this document.

本文件受BCP 78和IETF信托有关IETF文件的法律规定的约束(https://trustee.ietf.org/license-info)自本文件出版之日起生效。请仔细阅读这些文件，因为它们描述了您对本文件的权利和限制。

Table of Contents

   1.  Introduction  . . . . . . . . . . . . . . . . . . . . . . . .   3
   2.  Definitions and Acronyms  . . . . . . . . . . . . . . . . . .   5
   3.  Current Approaches  . . . . . . . . . . . . . . . . . . . . .   6
   4.  Use Case Description  . . . . . . . . . . . . . . . . . . . .   7
   5.  A Distributed Autonomic Solution  . . . . . . . . . . . . . .   8
   6.  Intended User Experience  . . . . . . . . . . . . . . . . . .  10
   7.  Implementation Considerations . . . . . . . . . . . . . . . .  11
     7.1.  Device-Based Self-Knowledge and Decisions . . . . . . . .  11
     7.2.  Interaction with Other Devices  . . . . . . . . . . . . .  11
   8.  Comparison with Current Solutions . . . . . . . . . . . . . .  12
   9.  Related IETF Work . . . . . . . . . . . . . . . . . . . . . .  12
   10. IANA Considerations . . . . . . . . . . . . . . . . . . . . .  13
   11. Security Considerations . . . . . . . . . . . . . . . . . . .  13
   12. Informative References  . . . . . . . . . . . . . . . . . . .  13
   Acknowledgements  . . . . . . . . . . . . . . . . . . . . . . . .  16
   Authors' Addresses  . . . . . . . . . . . . . . . . . . . . . . .  16

   1.  Introduction  . . . . . . . . . . . . . . . . . . . . . . . .   3
   2.  Definitions and Acronyms  . . . . . . . . . . . . . . . . . .   5
   3.  Current Approaches  . . . . . . . . . . . . . . . . . . . . .   6
   4.  Use Case Description  . . . . . . . . . . . . . . . . . . . .   7
   5.  A Distributed Autonomic Solution  . . . . . . . . . . . . . .   8
   6.  Intended User Experience  . . . . . . . . . . . . . . . . . .  10
   7.  Implementation Considerations . . . . . . . . . . . . . . . .  11
     7.1.  Device-Based Self-Knowledge and Decisions . . . . . . . .  11
     7.2.  Interaction with Other Devices  . . . . . . . . . . . . .  11
   8.  Comparison with Current Solutions . . . . . . . . . . . . . .  12
   9.  Related IETF Work . . . . . . . . . . . . . . . . . . . . . .  12
   10. IANA Considerations . . . . . . . . . . . . . . . . . . . . .  13
   11. Security Considerations . . . . . . . . . . . . . . . . . . .  13
   12. Informative References  . . . . . . . . . . . . . . . . . . .  13
   Acknowledgements  . . . . . . . . . . . . . . . . . . . . . . . .  16
   Authors' Addresses  . . . . . . . . . . . . . . . . . . . . . . .  16

1. Introduction

1. 介绍

The Internet has been growing dramatically in terms of size, capacity, and accessibility in recent years. Communication requirements of distributed services and applications running on top of the Internet have become increasingly demanding. Some examples are real-time interactive video or financial trading. Providing such services involves stringent requirements in terms of acceptable latency, loss, and jitter.

近年来，互联网在规模、容量和可访问性方面都在急剧增长。在Internet上运行的分布式服务和应用程序的通信需求越来越高。例如实时交互式视频或金融交易。提供此类服务需要严格的延迟、丢失和抖动要求。

Performance requirements lead to the articulation of Service Level Objectives (SLOs) that must be met. Those SLOs are part of Service Level Agreements (SLAs) that define a contract between the provider and the consumer of a service. SLOs, in effect, constitute a service-level guarantee that the consumer of the service can expect to receive (and often has to pay for). Likewise, the provider of a service needs to ensure that the service-level guarantee and associated SLOs are met. Some examples of clauses that relate to SLOs can be found in [RFC7297].

性能要求导致必须满足的服务级别目标（SLO）的明确化。这些SLO是服务级别协议（SLA）的一部分，SLA定义了服务提供者和使用者之间的合同。实际上，SLO构成了一种服务级别保证，服务的消费者可以期望得到（并且通常需要付费）。同样，服务提供商需要确保满足服务级别保证和相关SLO。[RFC7297]中有一些与SLO相关的子句示例。

Violations of SLOs can be associated with significant financial loss, which can by divided into two categories. First, there is the loss that can be incurred by the user of a service when the agreed service levels are not provided. For example, a financial brokerage's stock orders might suffer losses when it is unable to execute stock transactions in a timely manner. An electronic retailer may lose customers when its online presence is perceived by customers as sluggish. An online gaming provider may not be able to provide fair access to online players, resulting in frustrated players who are lost as customers. In each case, the failure of a service provider to meet promised service-level guarantees can have a substantial financial impact on users of the service. Second, there is the loss that is incurred by the provider of a service who is unable to meet promised SLOs. Those losses can take several forms, such as penalties for violating the service level agreement and even loss of future revenue due to reduced customer satisfaction (which, in many cases, is more serious). Hence, SLOs are a key concern for the service provider. In order to ensure that SLOs are not being violated, service levels need to be continuously monitored at the network infrastructure layer in order to know, for example, when mitigating actions need to be taken. To that end, service-level measurements must take place.

违反SLO可导致重大财务损失，可分为两类。首先，当没有提供约定的服务级别时，服务的用户可能会遭受损失。例如，当金融经纪公司无法及时执行股票交易时，其股票订单可能会遭受损失。当消费者认为电子零售商的在线状态迟钝时，它可能会失去顾客。在线游戏提供商可能无法为在线玩家提供公平的访问权限，从而导致沮丧的玩家失去客户。在每种情况下，服务提供商未能达到承诺的服务级别保证都会对服务用户产生重大的财务影响。第二，服务提供商因无法满足承诺的SLO而遭受的损失。这些损失可以采取多种形式，例如违反服务级别协议的处罚，甚至由于客户满意度降低（在许多情况下，更为严重）而导致的未来收入损失。因此，SLO是服务提供商关注的一个关键问题。为了确保不违反SLO，需要在网络基础设施层持续监控服务级别，以便知道（例如）何时需要采取缓解措施。为此，必须进行服务级别测量。

Network measurements can be performed using active or passive measurement techniques. In passive measurements, production traffic is observed, and no monitoring traffic is created by the measurement process itself. That is, network conditions are checked in a non-intrusive way. In the context of IP Flow Information Export

可以使用主动或被动测量技术执行网络测量。在被动测量中，观察生产流量，测量过程本身不会创建监控流量。也就是说，以非侵入方式检查网络状况。在IP流信息导出的上下文中

(IPFIX), several documents were produced that define how to export data associated with flow records, i.e., data that is collected as part of passive measurement mechanisms, generally applied against flows of production traffic (e.g., [RFC7011]). In addition, it is possible to collect real data traffic (not just summarized flow records) with time-stamped packets, possibly sampled (e.g., per [RFC5474]), as a means of measuring and inferring service levels. Active measurements, on the other hand, are more intrusive to the network in the sense that they involve injecting synthetic test traffic into the network to measure network service levels, as opposed to simply observing production traffic. The IP Performance Metrics (IPPM) Working Group produced documents that describe active measurement mechanisms such as the One-Way Active Measurement Protocol (OWAMP) [RFC4656], the Two-Way Active Measurement Protocol (TWAMP) [RFC5357], and the Cisco Service-Level Assurance Protocol [RFC6812]. In addition, there are some mechanisms that do not cleanly fit into either active or passive categories, such as Performance and Diagnostic Metrics (PDM) Destination Option techniques [RFC8250].

（IPFIX），产生了一些文件，定义了如何导出与流量记录相关的数据，即作为被动测量机制的一部分收集的数据，通常用于生产流量（例如，[RFC7011]）。此外，作为测量和推断服务水平的一种手段，可以使用可能采样的时间戳数据包（例如，根据[RFC5474]）收集真实数据流量（而不仅仅是汇总的流量记录）。另一方面，主动测量对网络的干扰更大，因为它们涉及将合成测试流量注入网络以测量网络服务级别，而不是简单地观察生产流量。IP性能度量（IPPM）工作组编制了描述主动测量机制的文件，如单向主动测量协议（OWAMP）[RFC4656]、双向主动测量协议（TWAMP）[RFC5357]和Cisco服务水平保证协议[RFC6812]。此外，还有一些机制不能完全归入主动或被动类别，例如性能和诊断指标（PDM）目标选项技术[RFC8250]。

Active measurement mechanisms offer a high level of control over what and how to measure. They do not require inspecting production traffic. Because of this, active measurements usually offer better accuracy and privacy than passive measurement mechanisms. Traffic encryption and regulations that limit the amount of payload inspection that can occur are non-issues. Furthermore, active measurement mechanisms are able to detect end-to-end network performance problems in a fine-grained way (e.g., simulating the traffic that must be handled considering specific SLOs). As a result, active measurements are often preferred over passive measurement for SLA monitoring. Measurement probes must be hosted in network devices and measurement sessions must be activated to compute the current network metrics (for example, metrics such as the ones described in [RFC4148], although note that [RFC4148] was obsoleted by [RFC6248]). This activation should be dynamic in order to follow changes in network conditions, such as those related to routes being added or new customer demands.

主动测量机制提供了对测量内容和方式的高度控制。它们不需要检查生产流量。因此，主动测量通常比被动测量机制提供更好的准确性和隐私性。流量加密和限制可能发生的有效负载检查量的法规不是问题。此外，主动测量机制能够以细粒度的方式检测端到端网络性能问题（例如，模拟必须考虑特定SLO处理的流量）。因此，对于SLA监控，主动测量通常优于被动测量。测量探针必须托管在网络设备中，并且必须激活测量会话以计算当前网络度量（例如，[RFC4148]中描述的度量，但请注意，[RFC4148]已被[RFC6248]淘汰）。此激活应该是动态的，以便跟踪网络条件的变化，例如与添加的路线或新客户需求相关的变化。

While offering many advantages, active measurements are expensive in terms of network resource consumption. Active measurements generally involve measurement probes that generate synthetic test traffic that is directed at a responder. The responder needs to timestamp test traffic it receives and reflect it back to the originating measurement probe. The measurement probe subsequently processes the returned packets along with time-stamping information in order to compute service levels. Accordingly, active measurements consume substantial CPU cycles as well as memory of network devices to

虽然主动测量具有许多优点，但就网络资源消耗而言，其成本很高。主动测量通常涉及产生指向响应者的合成测试流量的测量探头。响应者需要给接收到的测试流量加时间戳，并将其反射回原始测量探针。测量探针随后处理返回的数据包以及时间戳信息，以便计算服务级别。因此，主动测量会消耗大量的CPU周期以及网络设备的内存来进行测量

generate and process test traffic. In addition, synthetic traffic increases network load. Thus, active measurements compete for resources with other functions, including routing and switching.

生成和处理测试流量。此外，合成流量增加了网络负载。因此，主动测量与其他功能竞争资源，包括路由和交换。

The resources required and traffic generated by the active measurement sessions are, in a large part, a function of the number of measured network destinations. (In addition, the amount of traffic generated for each measurement plays a role that, in turn, influences the accuracy of the measurement.) When more destinations are measured, a greater number of resources are consumed and more traffic is needed to perform the measurements. Thus, to have better monitoring coverage, it is necessary to deploy more sessions, which consequently increases consumed resources. Otherwise, enabling the observation of just a small subset of all network flows can lead to insufficient coverage.

主动测量会话所需的资源和产生的流量在很大程度上是测量网络目的地数量的函数。（此外，每次测量产生的通信量起着作用，进而影响测量的准确性。）当测量更多目的地时，消耗更多的资源，需要更多的通信量来执行测量。因此，为了获得更好的监控覆盖率，有必要部署更多的会话，从而增加所消耗的资源。否则，仅对所有网络流的一小部分进行观察可能会导致覆盖不足。

Furthermore, while some end-to-end service levels can be determined by adding up the service levels observed across different path segments, the same is not true for all service levels. For example, the end-to-end delay or packet loss from a node A to a node C routed via a node B can often be computed simply by adding delays (or loss) from A to B and from B to C. This allows the decomposition of a large set of end-to-end measurements into a much smaller set of segment measurements. However, end-to-end jitter and mean opinion scores cannot be decomposed as easily and, for higher accuracy, must be measured end-to-end.

此外，虽然某些端到端服务级别可以通过将不同路径段上观察到的服务级别相加来确定，但并非所有服务级别都是如此。例如，从节点a到节点C经由节点B路由的端到端延迟或分组丢失通常可以简单地通过将从a到B和从B到C的延迟（或丢失）相加来计算。这允许将大量端到端测量分解为更小的段测量集。然而，端到端的抖动和平均意见分数不能很容易地分解，为了获得更高的准确度，必须端到端测量。

Hence, the decision about how to place measurement probes becomes an important management activity. The goal is to obtain the maximum benefits of service-level monitoring with a limited amount of measurement overhead. Specifically, the goal is to maximize the number of service-level violations that are detected with a limited number of resources.

因此，如何放置测量探针的决策成为一项重要的管理活动。目标是在有限的测量开销下获得服务级别监视的最大好处。具体来说，目标是最大限度地利用有限的资源检测到违反服务级别的情况。

The use case and the solution approach described in this document address an important practical issue. They are intended to provide a basis for further experimentation to lead to solutions for wider deployment. This document represents the consensus of the IRTF's Network Management Research Group (NMRG). It was discussed extensively and received three separate in-depth reviews.

本文档中描述的用例和解决方案方法解决了一个重要的实际问题。它们旨在为进一步的试验提供基础，从而为更广泛的部署提供解决方案。本文件代表了IRTF网络管理研究小组（NMRG）的共识。会议进行了广泛讨论，并收到了三次单独的深入审查。

2. Definitions and Acronyms

2. 定义和首字母缩略词

Active Measurements: Techniques to measure service levels that involve generating and observing synthetic test traffic

主动测量：测量服务级别的技术，包括生成和观察合成测试流量

Passive Measurements: Techniques used to measure service levels based on observation of production traffic

被动测量：基于对生产流量的观察来测量服务水平的技术

Autonomic Network: A network containing exclusively autonomic nodes, requiring no configuration, and deriving all required information through self-knowledge, discovery, or intent.

自主网络：一种包含独占自主节点的网络，不需要配置，通过自我了解、发现或意图获取所有必需的信息。

Autonomic Service Agent (ASA): An agent implemented on an autonomic node that implements an autonomic function, either in part (in the case of a distributed function, as in the context of this document) or whole

自主服务代理（ASA）：在自主节点上实现的代理，可部分实现自主功能（在分布式功能的情况下，如本文档中所述）或全部实现自主功能

Measurement Session: A communications association between a probe and a responder used to send and reflect synthetic test traffic for active measurements

测量会话：探测器和应答器之间的通信关联，用于发送和反映活动测量的合成测试流量

Probe: The source of synthetic test traffic in an active measurement

探测：主动测量中合成测试流量的来源

Responder: The destination for synthetic test traffic in an active measurement

响应者：活动测量中合成测试流量的目的地

SLA: Service Level Agreement

SLA：服务级别协议

SLO: Service Level Objective

SLO：服务水平目标

P2P: Peer-to-Peer

P2P：点对点

(Note: The definitions for "Autonomic Network" and "Autonomic Service Agent" are borrowed from [RFC7575]).

（注：“自主网络”和“自主服务代理”的定义借用自[RFC7575]）。

3. Current Approaches

3. 当前方法

For feasible deployments of active measurement solutions to distribute the available measurement sessions along the network, the current best practice consists of relying entirely on the human administrator's expertise to infer the best location to activate such sessions. This is done through several steps. First, it is necessary to collect traffic information in order to grasp the traffic matrix. Then, the administrator uses this information to infer the best destinations for measurement sessions. After that, the administrator activates sessions on the chosen subset of destinations, taking the available resources into account. This practice, however, does not scale well because it is still labor intensive and error-prone for the administrator to determine which sessions should be activated given the set of critical flows that needs to be measured. Even worse, this practice completely fails in networks where the most critical flows change rapidly, resulting in dynamic changes to what would be the most important destinations. For example, this can be the case in modern cloud environments. This is because fast reactions are necessary to reconfigure the sessions, and administrators are just not quick enough in computing and

对于主动测量解决方案的可行部署，以沿网络分布可用的测量会话，当前的最佳实践包括完全依赖人工管理员的专业知识来推断激活此类会话的最佳位置。这是通过几个步骤完成的。首先，为了掌握交通矩阵，有必要收集交通信息。然后，管理员使用此信息推断度量会话的最佳目的地。之后，管理员在考虑可用资源的情况下，在选定的目标子集上激活会话。但是，这种做法不能很好地扩展，因为考虑到需要度量的一组关键流，管理员仍然需要耗费大量人力，并且很容易出错，从而确定应该激活哪些会话。更糟糕的是，这种做法在网络中完全失败，因为网络中最关键的流量变化很快，导致最重要的目的地发生动态变化。例如，在现代云环境中可能就是这样。这是因为重新配置会话需要快速反应，而管理员在计算和管理方面不够快

activating the new set of required sessions every time the network traffic pattern changes. Finally, the current practice for active measurements usually covers only a fraction of the network flows that should be observed, which invariably leads to the damaging consequence of undetected SLA violations.

每次网络流量模式更改时激活新的所需会话集。最后，当前的主动测量实践通常只涵盖应观察的网络流的一小部分，这必然导致未检测到的SLA违反的破坏性后果。

4. Use Case Description

4. 用例描述

The use case involves a service-level provider that needs to monitor the network to detect service-level violations using active service-level measurements and wants to be able to do so with minimal human intervention. The goal is to conduct the measurements in an effective manner to maximize the percentage of detected service-level violations. The service-level provider has a bounded resource budget with regard to measurements that can be performed, specifically the number of measurements that can be conducted concurrently from any one network device and possibly the total amount of measurement traffic on the network. However, while at any one point in time the number of measurements conducted is limited, it is possible for a device to change which destinations to measure over time. This can be exploited to achieve a balance of eventually covering all possible destinations using a reasonable amount of "sampling" where measurement coverage of a destination cannot be continuous. The solution needs to be dynamic and able to cope with network conditions that may change over time. The solution should also be embeddable inside network devices that control the deployment of active measurement mechanisms.

该用例涉及一个服务级别提供商，该提供商需要监控网络，以使用活动服务级别度量来检测服务级别违规行为，并希望能够以最少的人为干预来做到这一点。目标是以有效的方式进行度量，以最大限度地提高检测到的服务级别违规的百分比。服务级别提供商具有关于可执行的测量的有界资源预算，具体地说，可从任何一个网络设备同时执行的测量的数量以及可能的网络上的测量流量的总量。然而，尽管在任何一个时间点进行的测量的数量是有限的，但是设备可以随时间改变要测量的目的地。在目的地的测量覆盖范围不能连续的情况下，可以利用这一点，使用合理数量的“抽样”来实现最终覆盖所有可能目的地的平衡。解决方案需要是动态的，并且能够应对可能随时间变化的网络条件。该解决方案还应可嵌入控制主动测量机制部署的网络设备中。

The goal is to conduct the measurements in a smart manner that ensures that the network is broadly covered and that the likelihood of detecting service-level violations is maximized. In order to maximize that likelihood, it is reasonable to focus measurement resources on destinations that are more likely to incur a violation, while spending fewer resources on destinations that are more likely to be in compliance. In order to do this, there are various aspects that can be exploited, including past measurements (destinations close to a service-level threshold requiring more focus than destinations farther from it), complementation with passive measurements such as flow data (to identify network destinations that are currently popular and critical), and observations from other parts of the network. In addition, measurements can be coordinated among different network devices to avoid hitting the same destination at the same time and to share results that may be useful in future probe placement.

目标是以智能的方式进行测量，以确保网络被广泛覆盖，并使检测到服务级别违规的可能性最大化。为了最大限度地提高这种可能性，合理的做法是将度量资源集中在更有可能发生违规的目的地，同时在更有可能遵守法规的目的地上花费更少的资源。为了做到这一点，可以利用多个方面，包括过去的测量（接近服务级别阈值的目的地需要比远离服务级别阈值的目的地更多的关注），补充被动测量，如流量数据（以识别当前流行且关键的网络目的地），以及来自网络其他部分的观察。此外，可以在不同的网络设备之间协调测量，以避免在同一时间击中同一目的地，并共享可能在未来探头放置中有用的结果。

Clearly, static solutions will have severe limitations. At the same time, human administrators cannot be in the loop for continuous dynamic reconfigurations of measurement probes. Thus, an automated

显然，静态解决方案将具有严重的局限性。同时，人力管理员不能参与测量探针的连续动态重新配置。因此，自动化

solution, or ideally an autonomic solution, is needed so that network measurements are automatically orchestrated and dynamically reconfigured from within the network. This can be accomplished using an autonomic solution that is distributed, using ASAs that are implemented on nodes in the network.

需要一个解决方案，或者理想情况下是一个自主解决方案，以便在网络内自动协调和动态重新配置网络测量。这可以通过使用在网络节点上实现的ASA，使用分布式的自主解决方案来实现。

5. A Distributed Autonomic Solution

5. 分布式自主解决方案

The use of Autonomic Networking (AN) [RFC7575] can help such detection through an efficient activation of measurement sessions. Such an approach, along with a detailed assessment confirming its viability, is described in [P2PBNM-Nobre-2012]. The problem to be solved by AN in the present use case is how to steer the process of measurement session activation by a complete solution that sets all necessary parameters for this activation to operate efficiently, reliably, and securely, with no required human intervention other than setting overall policy.

使用自主网络（AN）[RFC7575]可以通过有效激活测量会话来帮助进行此类检测。[P2PBNM-Nobre-2012]中描述了该方法以及确认其可行性的详细评估。在本用例中，AN需要解决的问题是，如何通过一个完整的解决方案来引导度量会话激活过程，该解决方案设置了该激活的所有必要参数，以高效、可靠和安全地运行，除了设置总体策略外，无需人工干预。

When a node first comes online, it has no information about which measurements are more critical than others. In the absence of information about past measurements and information from measurement peers, it may start with an initial set of measurement sessions, possibly randomly seeding a set of starter measurements and perhaps taking a round-robin approach for subsequent measurement rounds. However, as measurements are collected, a node will gain an increasing amount of information that it can utilize to refine its strategy of selecting measurement targets going forward. For one, it may take note of which targets returned measurement results very close to service-level thresholds; these targets may require closer scrutiny compared to others. Second, it may utilize observations that are made by its measurement peers in order to conclude which measurement targets may be more critical than others and to ensure that proper overall measurement coverage is obtained (so that not every node incidentally measures the same targets, while other targets are not measured at all).

当一个节点首次联机时，它没有关于哪些度量比其他度量更重要的信息。在缺乏关于过去测量的信息和来自测量同行的信息的情况下，它可以从一组初始测量会话开始，可能随机播种一组初始测量，并可能对后续测量轮采取循环方法。然而，随着测量数据的收集，节点将获得越来越多的信息，可以利用这些信息改进其选择测量目标的策略。首先，它可能会注意到哪些目标返回的测量结果非常接近服务级别阈值；与其他目标相比，这些目标可能需要更仔细的审查。其次，它可以利用其测量对等点的观察结果，以得出哪些测量目标可能比其他测量目标更关键，并确保获得适当的总体测量覆盖率（因此，并非每个节点偶然测量相同的目标，而其他目标则根本不测量）。

We advocate for embedding P2P technology in network devices in order to use autonomic control loops to make decisions about measurement sessions.

我们主张在网络设备中嵌入P2P技术，以便使用自主控制循环来决定测量会话。

Specifically, we advocate for network devices to implement an autonomic function that monitors service levels for violations of SLOs and that determines which measurement sessions to set up at any given point in time based on current and past observations of the node and of other peer nodes.

具体而言，我们主张网络设备实现一种自主功能，该功能可监控服务级别是否违反SLO，并根据节点和其他对等节点的当前和过去观察结果确定在任何给定时间点设置哪些测量会话。

By performing these functions locally and autonomically on the device itself, which measurements to conduct can be modified quickly based

通过在设备本身上本地和自主地执行这些功能，可以根据需要快速修改要进行的测量

on local observations while taking local resource availability into account. This allows a solution to be more robust and react more dynamically to rapidly changing service levels than a solution that has to rely on central coordination. However, in order to optimize decisions about which measurements to conduct, a node will need to communicate with other nodes. This allows a node to take into account other nodes' observations in addition to its own in its decisions.

在考虑当地资源可用性的同时，进行当地观测。这使得解决方案比依赖于集中协调的解决方案更健壮，对快速变化的服务级别做出更动态的反应。然而，为了优化关于进行哪些测量的决策，节点将需要与其他节点通信。这允许一个节点在其决策中除了考虑自己的观察之外，还考虑其他节点的观察。

For example, remote destinations whose observed service levels are on the verge of violating stated objectives may require closer monitoring than remote destinations that are comfortably within a range of tolerance. A distributed autonomic solution also allows nodes to coordinate their probing decisions to collectively achieve the best possible measurement coverage. Because the number of resources available for monitoring, exchanging measurement data, and coordinating with other nodes is limited, a node may be interested in identifying other nodes whose observations are similar to and correlated with its own. This helps a node prioritize and decide which other nodes to coordinate and exchange data with. All of this requires the use of a P2P overlay.

例如，与舒适地处于公差范围内的远程目的地相比，其观察到的服务级别接近于违反所述目标的远程目的地可能需要更密切的监控。分布式自主解决方案还允许节点协调其探测决策，共同实现最佳的测量覆盖率。由于可用于监视、交换测量数据以及与其他节点协调的资源的数量是有限的，因此节点可能有兴趣识别其观测值与其自身相似且相关的其他节点。这有助于节点排定优先级并决定与哪些其他节点协调和交换数据。所有这些都需要使用P2P覆盖。

A P2P overlay is essential for several reasons:

P2P覆盖非常重要，原因如下：

o It makes it possible for nodes (or more specifically, the ASAs that are deployed on those nodes) in the network to autonomically set up measurement sessions without having to rely on a central management system or controller to perform configuration operations associated with configuring measurement probes and responders.

o 它使得网络中的节点（或更具体地说，部署在这些节点上的ASA）能够自主地建立测量会话，而不必依赖中央管理系统或控制器来执行与配置测量探头和响应器相关的配置操作。

o It facilitates the exchange of data between different nodes to share measurement results so that each node can refine its measurement strategy based not just on its own observations, but also on observations from its peers.

o 它有助于不同节点之间的数据交换，以共享测量结果，这样每个节点不仅可以根据自己的观察结果，还可以根据来自对等节点的观察结果来改进其测量策略。

o It allows nodes to coordinate their measurements to obtain the best possible test coverage and avoid measurements that have a very low likelihood of detecting service-level violations.

o 它允许节点协调它们的测量，以获得最佳的测试覆盖率，并避免检测到服务级别违规的可能性非常低的测量。

The provisioning of the P2P overlay should be transparent for the network administrator. An Autonomic Control Plane such as defined in [ACP] provides an ideal candidate for the P2P overlay to run on.

P2P覆盖的设置对于网络管理员来说应该是透明的。[ACP]中定义的自主控制平面为P2P覆盖提供了一个理想的候选对象。

An autonomic solution for the distributed detection of SLA violations provides several benefits. First, it provides efficiency; this solution should optimize the resource consumption and avoid resource starvation on the network devices. A device that is "self-aware" of

用于分布式检测SLA冲突的自主解决方案提供了几个好处。首先，它提供了效率；此解决方案应优化资源消耗，并避免网络设备上的资源不足。一种“自我感知”的设备

its available resources will be able to adjust measurement activities rapidly as needed, without requiring a separate control loop involving resource monitoring by an external system. Second, placing logic about where to conduct measurements into the node enables rapid control loops that allow devices to react instantly to observations and adjust their measurement strategy. For example, a device could decide to adjust the amount of synthetic test traffic being sent during the measurement itself depending on results observed so far on this and other concurrent measurement sessions. As a result, the solution could decrease the time necessary to detect SLA violations. Adaptivity features of an autonomic loop could capture the network dynamics faster than a human administrator or even a central controller. Finally, the solution could help to reduce the workload of human administrators.

其可用资源将能够根据需要快速调整测量活动，而不需要外部系统进行涉及资源监控的单独控制回路。其次，将有关在何处进行测量的逻辑放置到节点中可以实现快速控制回路，从而允许设备对观察结果立即作出反应并调整其测量策略。例如，设备可以决定根据迄今为止在该和其他并发测量会话上观察到的结果来调整在测量期间发送的合成测试通信量。因此，该解决方案可以减少检测SLA冲突所需的时间。自主循环的自适应特性可以比人类管理员甚至中央控制器更快地捕获网络动态。最后，该解决方案有助于减少人工管理员的工作量。

In practice, these factors combine to maximize the likelihood of SLA violations being detected while operating within a given resource budget, allowing a continuous measurement strategy that takes into account past measurement results to be conducted, observations of other measures such as link utilization or flow data, measurement results shared between network devices, and future measurement activities coordinated among nodes. Combined, this can result in efficient measurement decisions that achieve a golden balance between offering broad network coverage and honing in on service-level "hot spots".

在实践中，这些因素结合起来可最大限度地提高在给定资源预算范围内运行时检测到SLA违规的可能性，从而允许持续测量策略，该策略考虑到要执行的过去测量结果、对其他测量（如链路利用率或流量数据）的观察，网络设备之间共享测量结果，节点之间协调未来的测量活动。综合起来，这可以产生有效的度量决策，在提供广泛的网络覆盖和磨练服务级别的“热点”之间实现黄金平衡。

6. Intended User Experience

6. 预期用户体验

The autonomic solution should not require any human intervention in the distributed detection of SLA violations. By virtue of the solution being autonomic, human users will not have to plan which measurements to conduct in a network, which is often a very labor-intensive task that requires detailed analysis of traffic matrices and network topologies and is not prone to easy dynamic adjustment. Likewise, they will not have to configure measurement probes and responders.

自主解决方案在SLA违规的分布式检测中不需要任何人工干预。由于该解决方案是自主的，人类用户将不必计划在网络中进行哪些测量，这通常是一项劳动密集型任务，需要对流量矩阵和网络拓扑进行详细分析，并且不容易进行轻松的动态调整。同样，他们不必配置测量探头和响应器。

There are some ways in which a human administrator may still interact with the solution. First, the human administrator will, of course, be notified and obtain reports about service-level violations that are observed. Second, a human administrator may set policies regarding how closely to monitor the network for service-level violations and how many resources to spend. For example, an administrator may set a resource budget that is assigned to network devices for measurement operations. With that given budget, the number of SLO violations that are detected will be maximized. Alternatively, an administrator may set a target for the percentage of SLO violations that must be detected, i.e., a target for the ratio

人工管理员仍可以通过某些方式与解决方案进行交互。首先，当然，人工管理员将收到通知，并获得有关所观察到的服务级别违规的报告。第二，人工管理员可以设置有关如何密切监视网络的服务级别冲突以及需要花费多少资源的策略。例如，管理员可以设置分配给网络设备用于测量操作的资源预算。在给定预算的情况下，检测到的SLO违规数量将最大化。或者，管理员可以为必须检测到的SLO违规百分比设置目标，即比率目标

between the number of detected SLO violations and the number of total SLO violations that are actually occurring (some of which might go undetected). In that case, the solution will aim to minimize the resources spent (i.e., the amount of test traffic and number of measurement sessions) that are required to achieve that target.

在检测到的SLO违规数量和实际发生的SLO违规总数之间（其中一些可能未被检测到）。在这种情况下，解决方案的目标是最小化实现该目标所需的资源（即测试通信量和测量会话数）。

7. Implementation Considerations

7. 实施考虑

The active measurement model assumes that a typical infrastructure will have multiple network segments, multiple Autonomous Systems (ASes), and a reasonably large number of routers. It also considers that multiple SLOs can be in place at a given time. Since interoperability in a heterogeneous network is a goal, features found on different active measurement mechanisms (e.g., OWAMP, TWAMP, and Cisco Service Level Assurance Protocol) and device programmability interfaces (such as Juniper's Junos API or Cisco's Embedded Event Manager) could be used for the implementation. The autonomic solution should include and/or reference specific algorithms, protocols, metrics, and technologies for the implementation of distributed detection of SLA violations as a whole.

主动测量模型假设一个典型的基础设施将有多个网段、多个自治系统（ASE）和相当多的路由器。它还考虑到在给定的时间内可以有多个SLO。由于异构网络中的互操作性是一个目标，不同主动测量机制（如OWAMP、TWAMP和Cisco服务级别保证协议）和设备可编程接口（如Juniper的Junos API或Cisco的嵌入式事件管理器）上的功能可用于实现。自主解决方案应包括和/或参考特定的算法、协议、度量和技术，以实现对SLA违规行为的分布式检测。

Finally, it should be noted that there are multiple deployment scenarios, including deployment scenarios that involve physical devices hosting autonomic functions or virtualized infrastructure hosting the same. Co-deployment in conjunction with Virtual Network Functions (VNFs) is a possibility for further study.

最后，应该注意的是，存在多种部署场景，包括涉及托管自主功能的物理设备或托管自主功能的虚拟化基础设施的部署场景。联合部署与虚拟网络功能（VNF）是进一步研究的可能性。

7.1. Device-Based Self-Knowledge and Decisions

7.1. 基于设备的自我认识和决策

Each device has self-knowledge about the local SLA monitoring. This could be in the form of historical measurement data and SLOs. Besides that, the devices would have algorithms that could decide which probes should be activated at a given time. The choice of which algorithm is better for a specific situation would be also autonomic.

每个设备都可以自行了解本地SLA监控。这可以是历史测量数据和SLO的形式。除此之外，这些设备将具有算法，可以决定在给定时间应激活哪些探测器。选择哪种算法更适合于特定情况也是自主的。

7.2. Interaction with Other Devices

7.2. 与其他设备的交互

Network devices should share information about service-level measurement results. This information can speed up the detection of SLA violations and increase the number of detected SLA violations. For example, if one device detects that a remote destination is in danger of violating an SLO, other devices may conduct additional measurements to the same destination or other destinations in its proximity. For any given network device, the exchange of data may be more important with some devices (for example, devices in the same network neighborhood or devices that are "correlated" by some other means) than with others. Defining the network devices that exchange

网络设备应共享有关服务级别测量结果的信息。此信息可以加快SLA违规检测的速度，并增加检测到的SLA违规数量。例如，如果一个设备检测到远程目的地有违反SLO的危险，则其他设备可以对同一目的地或其附近的其他目的地进行额外测量。对于任何给定的网络设备，与某些设备（例如，同一网络邻居中的设备或通过某些其他方式“相关”的设备）的数据交换可能比与其他设备的数据交换更重要。定义用于交换数据的网络设备

measurement data (i.e., management peers) creates a new topology. Different approaches could be used to define this topology (e.g., correlated peers [P2PBNM-Nobre-2012]). To bootstrap peer selection, each device should use its known neighbors (e.g., FIB and RIB tables) as initial seeds to identify possible peers. It should be noted that a solution will benefit if topology information and network discovery functions are provided by the underlying autonomic framework. A solution will need to be able to discover measurement peers as well as measurement targets, specifically measurement targets that support active measurement responders and that will be able to respond to measurement requests and reflect measurement traffic as needed.

测量数据（即管理对等点）创建新拓扑。可以使用不同的方法来定义该拓扑（例如，相关对等方[P2PBNM-Nobre-2012]）。为了引导对等选择，每个设备都应该使用其已知的邻居（例如FIB和RIB表）作为初始种子来识别可能的对等点。应该注意的是，如果拓扑信息和网络发现功能由底层自治框架提供，那么解决方案将受益匪浅。解决方案需要能够发现测量对等点和测量目标，特别是支持主动测量响应者的测量目标，并且能够响应测量请求并根据需要反映测量流量。

8. Comparison with Current Solutions

8. 与当前解决方案的比较

There is no standardized solution for distributed autonomic detection of SLA violations. Current solutions are restricted to ad hoc scripts running on a per-node fashion to automate some administrator actions. There are some proposals for passive probe activation (e.g., DECON [DECON] and CSAMP [CSAMP]), but these do not focus on autonomic features.

对于SLA违反的分布式自主检测，没有标准化的解决方案。当前的解决方案仅限于以每节点方式运行的临时脚本，以自动化某些管理员操作。有一些关于被动探头激活的建议（例如，DECON[DECON]和CSAMP[CSAMP]），但这些建议并不侧重于自主功能。

9. Related IETF Work

9. 相关IETF工作

This section discusses related IETF work and is provided for reference. This section is not exhaustive; rather, it provides an overview of the various initiatives and how they relate to autonomic distributed detection of SLA violations.

本节讨论了相关的IETF工作，仅供参考。本节并非详尽无遗；相反，它提供了各种方案的概述，以及它们与SLA违反的自主分布式检测的关系。

1. LMAP: The Large-Scale Measurement of Broadband Performance Working Group standardizes the LMAP measurement system for performance management of broadband access devices. The autonomic solution could be relevant to LMAP because it deploys measurement probes and could be used for screening for SLA violations. Besides that, a solution to decrease the workload of human administrators in service providers is probably highly desirable.

1. LMAP：宽带性能大规模测量工作组标准化了宽带接入设备性能管理的LMAP测量系统。自主解决方案可能与LMAP相关，因为它部署了测量探针，并可用于筛选SLA违规行为。除此之外，减少服务提供商中人工管理员工作量的解决方案可能是非常可取的。

2. IPFIX: IP Flow Information Export (IPFIX) Working Group (now concluded) aimed to standardize IP flows (i.e., netflows). IPFIX uses measurement probes (i.e., metering exporters) to gather flow data. In this context, the autonomic solution for the activation of active measurement probes could possibly be extended to also address passive measurement probes. Besides that, flow information could be used in making decisions regarding probe activation.

2. IPFIX：IP流信息导出（IPFIX）工作组（现已结束），旨在标准化IP流（即网络流）。IPFIX使用测量探头（即计量导出器）收集流量数据。在这种情况下，激活主动测量探针的自主解决方案可能会扩展到也处理被动测量探针。除此之外，流量信息还可用于有关探针激活的决策。

3. ALTO: The Application-Layer Traffic Optimization Working Group aims to provide topological information at a higher abstraction layer, which can be based upon network policy, and with application-relevant service functions located in it. Their work could be leveraged to define the topology for network devices that exchange measurement data.

3. ALTO：应用层流量优化工作组的目标是在更高的抽象层上提供拓扑信息，该抽象层可以基于网络策略，其中包含与应用相关的服务功能。他们的工作可以用来定义交换测量数据的网络设备的拓扑结构。

10. IANA Considerations

10. IANA考虑

This document has no IANA actions.

本文档没有IANA操作。

11. Security Considerations

11. 安全考虑

The security of this solution hinges on the security of the network underlay, i.e., the Autonomic Control Plane. If the Autonomic Control Plane were to be compromised, an attacker could undermine the effectiveness of measurement coordination by reporting fraudulent measurement results to peers. This would cause measurement probes to be deployed in an ineffective manner that would increase the likelihood that violations of SLOs go undetected.

此解决方案的安全性取决于网络底层（即自主控制平面）的安全性。如果自主控制平面遭到破坏，攻击者可以通过向对等方报告欺诈性测量结果来破坏测量协调的有效性。这将导致以无效的方式部署测量探针，从而增加违反SLO的行为未被发现的可能性。

Likewise, the security of the solution hinges on the security of the deployment mechanism for autonomic functions (in this case, the autonomic function that conducts the service-level measurements). If an attacker were able to hijack an autonomic function, it could try to exhaust or exceed the resources that should be spent on autonomic measurements in order to deplete network resources, including network bandwidth due to higher-than-necessary volumes of synthetic test traffic generated by measurement probes. Again, it could also lead to reporting of misleading results; among other things, this could result in non-optimal selection of measurement targets and, in turn, an increase in the likelihood that service-level violations go undetected.

同样，解决方案的安全性取决于自主功能部署机制的安全性（在本例中，是执行服务级别度量的自主功能）。如果攻击者能够劫持自主功能，则可能会尝试耗尽或超过应用于自主测量的资源，以耗尽网络资源，包括由于测量探针生成的合成测试流量超过必要量而导致的网络带宽。同样，这也可能导致报告误导性结果；除其他外，这可能会导致度量目标的非最佳选择，进而增加服务级别违规未被发现的可能性。

12. Informative References

12. 资料性引用

[ACP] Eckert, T., Ed., Behringer, M., Ed., and S. Bjarnason, "An Autonomic Control Plane (ACP)", Work in Progress, draft-ietf-anima-autonomic-control-plane-13, December 2017.

[ACP]Eckert，T.，Ed.，Behringer，M.，Ed.，和S.Bjarnason，“自主控制平面（ACP）”，正在进行的工作，草稿-ietf-anima-Autonomic-Control-Plane-132017年12月。

[CSAMP] Sekar, V., Reiter, M., Willinger, W., Zhang, H., Kompella, R., and D. Andersen, "CSAMP: A System for Network-Wide Flow Monitoring", NSDI USENIX Symposium Networked Systems Design and Implementation, April 2008.

[CSAMP]Sekar，V.，Reiter，M.，Willinger，W.，Zhang，H.，Kompella，R.，和D.Andersen，“CSAMP：用于网络范围流量监控的系统”，NSDI USENIX网络化系统设计与实施研讨会，2008年4月。

[DECON] di Pietro, A., Huici, F., Costantini, D., and S. Niccolini, "DECON: Decentralized Coordination for Large-Scale Flow Monitoring", IEEE INFOCOM Workshops, DOI 10.1109/INFCOMW.2010.5466642, March 2010.

[DECON]di Pietro，A.，Huici，F.，Costantini，D.，和S.Niccolini，“DECON：大规模流量监测的分散协调”，IEEE信息通信研讨会，DOI 10.1109/INFCOMW.2010.5466642，2010年3月。

[P2PBNM-Nobre-2012] Nobre, J., Granville, L., Clemm, A., and A. Gonzalez Prieto, "Decentralized Detection of SLA Violations Using P2P Technology, 8th International Conference Network and Service Management (CNSM)", 8th International Conference on Network and Service Management (CNSM), 2012, <http://ieeexplore.ieee.org/xpls/ abs_all.jsp?arnumber=6379997>.

[P2PBNM-Nobre-2012]Nobre，J.，Granville，L.，Clemm，A.，和A.Gonzalez Prieto，“使用P2P技术分散检测SLA违规行为，第八届国际会议网络和服务管理（CNSM）”，第八届国际网络和服务管理会议（CNSM），2012年， <http://ieeexplore.ieee.org/xpls/ abs_all.jsp？arnumber=6379997>。

[RFC4148] Stephan, E., "IP Performance Metrics (IPPM) Metrics Registry", BCP 108, RFC 4148, DOI 10.17487/RFC4148, August 2005, <https://www.rfc-editor.org/info/rfc4148>.

[RFC4148]Stephan，E.“IP性能度量（IPPM）度量注册表”，BCP 108，RFC 4148，DOI 10.17487/RFC4148，2005年8月<https://www.rfc-editor.org/info/rfc4148>.

[RFC4656] Shalunov, S., Teitelbaum, B., Karp, A., Boote, J., and M. Zekauskas, "A One-way Active Measurement Protocol (OWAMP)", RFC 4656, DOI 10.17487/RFC4656, September 2006, <https://www.rfc-editor.org/info/rfc4656>.

[RFC4656]Shalunov，S.，Teitelbaum，B.，Karp，A.，Boote，J.，和M.Zekauskas，“单向主动测量协议（OWAMP）”，RFC 4656，DOI 10.17487/RFC4656，2006年9月<https://www.rfc-editor.org/info/rfc4656>.

[RFC5357] Hedayat, K., Krzanowski, R., Morton, A., Yum, K., and J. Babiarz, "A Two-Way Active Measurement Protocol (TWAMP)", RFC 5357, DOI 10.17487/RFC5357, October 2008, <https://www.rfc-editor.org/info/rfc5357>.

[RFC5357]Hedayat，K.，Krzanowski，R.，Morton，A.，Yum，K.，和J.Babiarz，“双向主动测量协议（TWAMP）”，RFC 5357，DOI 10.17487/RFC5357，2008年10月<https://www.rfc-editor.org/info/rfc5357>.

[RFC5474] Duffield, N., Ed., Chiou, D., Claise, B., Greenberg, A., Grossglauser, M., and J. Rexford, "A Framework for Packet Selection and Reporting", RFC 5474, DOI 10.17487/RFC5474, March 2009, <https://www.rfc-editor.org/info/rfc5474>.

[RFC5474]Duffield，N.，Ed.，Chiou，D.，Claise，B.，Greenberg，A.，Grossglauser，M.，和J.Rexford，“数据包选择和报告框架”，RFC 5474，DOI 10.17487/RFC54742009年3月<https://www.rfc-editor.org/info/rfc5474>.

[RFC6248] Morton, A., "RFC 4148 and the IP Performance Metrics (IPPM) Registry of Metrics Are Obsolete", RFC 6248, DOI 10.17487/RFC6248, April 2011, <https://www.rfc-editor.org/info/rfc6248>.

[RFC6248]Morton，A.，“RFC 4148和IP性能度量（IPPM）度量注册表已过时”，RFC 6248，DOI 10.17487/RFC6248，2011年4月<https://www.rfc-editor.org/info/rfc6248>.

[RFC6812] Chiba, M., Clemm, A., Medley, S., Salowey, J., Thombare, S., and E. Yedavalli, "Cisco Service-Level Assurance Protocol", RFC 6812, DOI 10.17487/RFC6812, January 2013, <https://www.rfc-editor.org/info/rfc6812>.

[RFC6812]Chiba，M.，Clemm，A.，Medley，S.，Salowey，J.，Thombare，S.，和E.Yedavalli，“思科服务水平保证协议”，RFC 6812，DOI 10.17487/RFC6812，2013年1月<https://www.rfc-editor.org/info/rfc6812>.

[RFC7011] Claise, B., Ed., Trammell, B., Ed., and P. Aitken, "Specification of the IP Flow Information Export (IPFIX) Protocol for the Exchange of Flow Information", STD 77, RFC 7011, DOI 10.17487/RFC7011, September 2013, <https://www.rfc-editor.org/info/rfc7011>.

[RFC7011]Claise，B.，Ed.，Trammell，B.，Ed.，和P.Aitken，“流量信息交换的IP流量信息导出（IPFIX）协议规范”，STD 77，RFC 7011，DOI 10.17487/RFC7011，2013年9月<https://www.rfc-editor.org/info/rfc7011>.

[RFC7297] Boucadair, M., Jacquenet, C., and N. Wang, "IP Connectivity Provisioning Profile (CPP)", RFC 7297, DOI 10.17487/RFC7297, July 2014, <https://www.rfc-editor.org/info/rfc7297>.

[RFC7297]Boucadair，M.，Jacquenet，C.和N.Wang，“IP连接配置文件（CPP）”，RFC 7297，DOI 10.17487/RFC7297，2014年7月<https://www.rfc-editor.org/info/rfc7297>.

[RFC7575] Behringer, M., Pritikin, M., Bjarnason, S., Clemm, A., Carpenter, B., Jiang, S., and L. Ciavaglia, "Autonomic Networking: Definitions and Design Goals", RFC 7575, DOI 10.17487/RFC7575, June 2015, <https://www.rfc-editor.org/info/rfc7575>.

[RFC7575]Behringer，M.，Pritikin，M.，Bjarnason，S.，Clemm，A.，Carpenter，B.，Jiang，S.，和L.Ciavaglia，“自主网络：定义和设计目标”，RFC 7575，DOI 10.17487/RFC7575752015年6月<https://www.rfc-editor.org/info/rfc7575>.

[RFC8250] Elkins, N., Hamilton, R., and M. Ackermann, "IPv6 Performance and Diagnostic Metrics (PDM) Destination Option", RFC 8250, DOI 10.17487/RFC8250, September 2017, <https://www.rfc-editor.org/info/rfc8250>.

[RFC8250]北埃尔金斯、右汉密尔顿和M.阿克曼，“IPv6性能和诊断指标（PDM）目标选项”，RFC 8250，DOI 10.17487/RFC8250，2017年9月<https://www.rfc-editor.org/info/rfc8250>.

Acknowledgements

致谢

We wish to acknowledge the helpful contributions, comments, and suggestions that were received from Mohamed Boucadair, Brian Carpenter, Hanlin Fang, Bruno Klauser, Diego Lopez, Vincent Roca, and Eric Voit. In addition, we thank Diego Lopez, Vincent Roca, and Brian Carpenter for their detailed reviews.

我们希望感谢Mohamed Boucadair、Brian Carpenter、Hanlin Fang、Bruno Klauser、Diego Lopez、Vincent Roca和Eric Voit提供的有益贡献、评论和建议。此外，我们感谢迭戈·洛佩兹、文森特·罗卡和布赖恩·卡彭特的详细评论。

Authors' Addresses

作者地址

Jeferson Campos Nobre University of Vale do Rio dos Sinos Porto Alegre Brazil

阿雷格里港的巴西州里约热内卢大学

   Email: jcnobre@unisinos.br

   Email: jcnobre@unisinos.br

Lisandro Zambenedetti Granvile Federal University of Rio Grande do Sul Porto Alegre Brazil

里奥格兰德联邦大学Sul阿雷格里港巴西

   Email: granville@inf.ufrgs.br

   Email: granville@inf.ufrgs.br

Alexander Clemm Huawei USA - Futurewei Technologies Inc. Santa Clara, California United States of America

Alexander Clemm Huawei USA-美国加利福尼亚州圣克拉拉市Futurewei技术有限公司

   Email: ludwig@clemm.org, alexander.clemm@huawei.com

   Email: ludwig@clemm.org, alexander.clemm@huawei.com

Alberto Gonzalez Prieto VMware Palo Alto, California United States of America

Alberto Gonzalez Prieto VMware Palo Alto，美国加利福尼亚州

   Email: agonzalezpri@vmware.com

   Email: agonzalezpri@vmware.com