Internet-Draft | Telemetry Data Manifest | October 2024 |
Claise, et al. | Expires 24 April 2025 | [Page] |
Network platforms use Model-driven Telemetry, and in particular YANG-Push, to continuously stream information, including both counters and state information. This document documents the metadata that ensure that the collected data can be interpreted correctly. This document specifies the Data Manifest, composed of two YANG data models (the Platform Manifest and the Data Collection Manifest). These YANG modules are specified at the network (i.e. controller) level to provide a model that encompasses several network platforms. The Data Manifest must be streamed and stored along with the data, up to the collection and analytics system in order to keep the collected data fully exploitable by the data scientists.¶
This Internet-Draft is submitted in full conformance with the provisions of BCP 78 and BCP 79.¶
Internet-Drafts are working documents of the Internet Engineering Task Force (IETF). Note that other groups may also distribute working documents as Internet-Drafts. The list of current Internet-Drafts is at https://datatracker.ietf.org/drafts/current/.¶
Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as "work in progress."¶
This Internet-Draft will expire on 24 April 2025.¶
Copyright (c) 2024 IETF Trust and the persons identified as the document authors. All rights reserved.¶
This document is subject to BCP 78 and the IETF Trust's Legal Provisions Relating to IETF Documents (https://trustee.ietf.org/license-info) in effect on the date of publication of this document. Please review these documents carefully, as they describe your rights and restrictions with respect to this document. Code Components extracted from this document must include Revised BSD License text as described in Section 4.e of the Trust Legal Provisions and are provided without warranty as described in the Revised BSD License.¶
Network platforms use Model-driven Telemetry (MDT), and in particular YANG-Push [RFC8641], to continuously stream information, including both counters and state information.¶
This document specifies what needs to be kept as metadata (i.e., the Data Manifest) to ensure that the collected data can still be interpreted correctly throughout the collection and network analytics toolchain. When streaming YANG-structured data with YANG-Push [RFC8641], there is a semantic definition in the corresponding YANG module definition. This is the semantic information for the collected data nodes: While this semantic is absolutely required to correctly decode and interpret the data, understanding the network platform and collection environment contexts information is equally important to interpret the data.¶
This document proposes the Data Manifest, which is composed of two YANG data models, namely, the Platform Manifest and the Data Collection Manifest, in order to keep the collected data exploitable by the data scientists.¶
The Platform Manifest contains information characterizing the platform streaming the telemetry information, while the the Data Collection Manifest contains the required information to characterize how and when the telemetry information was metered.¶
The two proposed YANG modules in the Data Manifest do not expose many new information but rather define what should be exposed by a platform streaming or storing telemetry data. Some related YANG modules have been specified to retrieve the platform capabilities:¶
These related YANG modules are important to discover the capabilities before applying the telemetry configuration (such as on-change). Some of their content is part of the context for the streamed data.¶
This documents covers only metadata about the collection context for the telemetry. The collected data is likely to be transformed into usable indicators for the network. The list of such transformation operation applied to the data is often called data lineage. Supplying the data lineage for the computed indicators is out of scope of this document.¶
We first present the module for the Platform Manifest in Section 3 and then the module for the Data Collection Manifest in Section 4. The full Data Manifest is obtained by combining these two modules. We explain in Section 5 how the Data Manifest can be retrieved and how collected data is mapped to the Data Manifest.¶
Streamed information from network platforms is used for network analytics, incident detections, and in the end closed-loop automation. This streamed data can be stored in a database (sometimes called a big data lake) for further analysis.¶
As an example, a database could store a time series representing the evolution of a specific counter collected from a network platform. When analyzing the data, the network operator/data scientist must understand the context information for these data:¶
Characterizing the source used for producing the data (vendor, platform, and OS) is useful to complement the data. As an example, knowing the exact data source software specification might reveal a particularity in the observed data, explained by a specific bug, a specific bug fix, or simply a particular specific behavior. This is also necessary to ensure the reliability of the collected data. On top of that, in particular for YANG-Push [RFC8641], it is crucial to know the set of YANG modules supported by the platform, along with their deviations. In some cases, there might even be some backwards incompatible changes in native modules between one OS version to the next one. This information is captured by the proposed Platform Manifest.¶
From a collection parameters point of view, the data scientists analyzing the collected data must know that the counter was requested from the network platform as on-change or at specific cadence. Indeed, an on-change collection explains why there is a single value as opposed to a time series. In case of periodic collection, this exact cadence might not be observable in the time series. Indeed, this time series might report some values as 0 or might even omit some values. The reason for this behavior might be diverse: the network platform was under stress, with a too small observation period, compared to the minimum-observed-period [I-D.claise-netconf-metadata-for-collection]. Again, knowing the conditions under which the counter was collected and streamed (along with the platform details) help drawing the right conclusions. As an example, taking into account the value of 0 might lead to a wrong conclusion that the counter dropped to zero. This document specifies the Data Collection Manifest, which contains the required information to characterize how and when the telemetry information was metered.¶
The goal of the current document is to define what needs to be kept as metadata (i.e., the Data Manifest) to ensure that the collected data can still be interpreted correctly.¶
When a new device is onboarded, operators must make sure that the new device streams data with YANG-Push, that the telemetry data is the right ones, that the data is correctly ingested in the collection system, and finally that the data can be analyzed (compared with other similar devices). For the last point, the Data Manifest, which must be linked to the data up to the collection and analytics system, contains all the relevant information.¶
The concept behind the data mesh https://www.datamesh-architecture.com/ are:¶
The most relevant concept for this document is the "Data as a Product" principle. The Data Manifest fulfills this principle as the two YANG data models, Platform Manifest and the Data Collection Manifest, along with the data, provide all the necessary information in a self-describing way for easy consumption.¶
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "NOT RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in BCP 14 [RFC2119] [RFC8174] when, and only when, they appear in all capitals, as shown here.¶
Data Manifest: all the necessary data required to interpret the telemetry information.¶
Platform Manifest: part of the Data Manifest that completely characterizes the platform producing the telemetry information¶
Data Collection Manifest: part of the Data Manifest that completely characterizes how and when the telemetry information was metered.¶
Figure 1 contains the YANG tree diagram [RFC8340] of the ietf-platform-manifest module. The tree diagram is obtained by reusing existing modules, such as ietf-yang-library [RFC8525] using the YANG mount mechanism [RFC8528]. Additionally to the YANG module in Section 3.2, we specify the YANG-library instance to be mounted. We explain in Appendix C how the YANG tree is obtained.¶
The YANG module actually contains a list of Platform Manifests (in 'platforms/platform'), indexed by the identifier of the platform. That identifier should be defined by the network manager so that each platform has a unique id. There are several ongoing drafts about managing the inventory of the network [I-D.ietf-ivy-network-inventory-yang], [I-D.havel-nmop-digital-map] based on [RFC8345]. The platform-id should be the same as the identifier used in these drafts or the node id in [RFC8345]. As an example, the identifier could be the 'sysname' from the ietf-notification module presented in [I-D.tgraf-netconf-notif-sequencing]. The scope of this module is the scope of the data collection, i.e. a given network, therefore it contains a collection of Platform Manifests, as opposed to the device scope, which would contain a single Platform Manifest.¶
The Platform Manifest is identified by a set of parameters ('name', 'software-version', 'software-flavor', 'os-version', 'os-type') that are aligned with the YANG Catalog www.yangcatalog.org [I-D.clacla-netmod-model-catalog] so that the YANG Catalog could be used to retrieve the YANG modules a posteriori. The vendor of the platform can be identified via its name 'vendor' or its PEN number 'vendor-pen', as described in [RFC9371].¶
The Platform Manifest also includes the contents of the YANG Library [RFC8525]. That module set is particularly useful to analyze the xpath filters, as they are based on module names (see [RFC8639], page 47).¶
The Platform Manifest is obtained by specifying the new fields defined above and mounting the YANG library module, along with the YANG Revisions augmentations. Thus, the YANG Library part is not repeated in the YANG module for the Platform Manifest.¶
We provide in this Section the code of the ietf-platform manifest YANG module. Additionally, we provide the extension data file for YANG schema mount. The platform manifest MUST conform to the model obtained by combining these two specifications.¶
<CODE BEGINS> file "ietf-platform-manifest@2024-07-02.yang" module ietf-platform-manifest { yang-version 1.1; namespace "urn:ietf:params:xml:ns:yang:ietf-platform-manifest"; prefix p-mf; import ietf-yang-schema-mount { prefix yangmnt; reference "RFC8528: YANG Schema Mount"; } organization "IETF OPSAWG (Network Configuration) Working Group"; contact "WG Web: <https://datatracker.ietf.org/wg/opsawg/> WG List: <mailto:opsawg@ietf.org> Author: Benoit Claise <mailto:benoit.claise@huawei.com> Author: Jean Quilbeuf <mailto:jean.quilbeuf@huawei.com> Author: Diego R. Lopez <diego.r.lopez@telefonica.com> Author: Ignacio Dominguez <ignacio.dominguezmartinez@telefonica.com> Author: Thomas Graf <thomas.graf@swisscom.com>"; description "This module describes the platform information to be used as context of data collection from a given network element. The contents of this model must be streamed along with the data streamed from the network element so that the platform context of the data collection can be retrieved later. The data content of this model should not change except on upgrade or patching of the device. The key words 'MUST', 'MUST NOT', 'REQUIRED', 'SHALL', 'SHALL NOT', 'SHOULD', 'SHOULD NOT', 'RECOMMENDED', 'NOT RECOMMENDED', 'MAY', and 'OPTIONAL' in this document are to be interpreted as described in BCP 14 (RFC 2119) (RFC 8174) when, and only when, they appear in all capitals, as shown here. Copyright (c) 2022 IETF Trust and the persons identified as authors of the code. All rights reserved. Redistribution and use in source and binary forms, with or without modification, is permitted pursuant to, and subject to the license terms contained in, the Revised BSD License set forth in Section 4.c of the IETF Trust's Legal Provisions Relating to IETF Documents (https://trustee.ietf.org/license-info). This version of this YANG module is part of RFC XXXX; see the RFC itself for full legal notices. "; revision 2024-07-02 { description "Initial revision"; reference "RFC xxxx: Title to be completed"; } container platforms { config false; description "Top container including all platforms in scope. If this model is hosted on a single device, it should contain a single entry in the list. At the network level, it should contain an entry for every monitored platform."; list platform { key "id"; description "Contains information about the platform that allows to identify and understand the individual data collection information. "; leaf id { type string; description "Identifies a given platform on the network, for instance the sysname of the plaftorm. The id has to be unique on the network."; } leaf name { type string; description "Model of the platform from which data is collected."; } leaf vendor { type string; description "Organization that implements that platform."; } leaf vendor-pen { type uint32; description "Vendor''s registered Private Enterprise Number as described in RFC9371"; } leaf software-version { type string; description "Name of the version of software. With respect to most network device appliances, this will be the operating system version. But for other YANG module implementation, this would be a version of appliance software. Ultimately, this should correspond to a version string that will be recognizable by the consumers of the platform."; } leaf software-flavor { type string; description "A variation of a specific version where YANG model support may be different. Depending on the vendor, this could be a license, additional software component, or a feature set."; } leaf os-version { type string; description "Version of the operating system using this module. This is primarily useful if the software implementing the module is an application that requires a specific operating system version."; } leaf os-type { type string; description "Type of the operating system using this module. This is primarily useful if the software implementing the module is an application that requires a specific operating system type."; } yangmnt:mount-point "yang-library"; } } } <CODE ENDS>¶
<CODE BEGINS> file "platform-extension-data.xml" <yang-library xmlns="urn:ietf:params:xml:ns:yang:ietf-yang-library" xmlns:ds="urn:ietf:params:xml:ns:yang:ietf-datastores"> <module-set> <name>mountee-set</name> <module> <name>ietf-yang-status-conformance</name> <revision>2024-02-14</revision> <namespace> urn:ietf:params:xml:ns:yang:ietf-yang-status-conformance </namespace> </module> <module> <name>ietf-datastores</name> <revision>2018-02-14</revision> <namespace> urn:ietf:params:xml:ns:yang:ietf-datastores </namespace> </module> <module> <name>ietf-yang-library</name> <revision>2019-01-04</revision> <namespace> urn:ietf:params:xml:ns:yang:ietf-yang-library </namespace> </module> <import-only-module> <name>ietf-yang-types</name> <revision>2013-07-15</revision> <namespace> urn:ietf:params:xml:ns:yang:ietf-yang-types </namespace> </import-only-module> <import-only-module> <name>ietf-inet-types</name> <revision>2013-07-15</revision> <namespace> urn:ietf:params:xml:ns:yang:ietf-inet-types </namespace> </import-only-module> </module-set> <schema> <name>test-schema</name> <module-set>mountee-set</module-set> </schema> <datastore> <name>ds:running</name> <schema>test-schema</schema> </datastore> <datastore> <name>ds:operational</name> <schema>test-schema</schema> </datastore> <content-id>2</content-id> </yang-library> <modules-state xmlns="urn:ietf:params:xml:ns:yang:ietf-yang-library"> <module-set-id>2</module-set-id> </modules-state> <schema-mounts xmlns="urn:ietf:params:xml:ns:yang:ietf-yang-schema-mount"> <mount-point> <module>ietf-platform-manifest</module> <label>yang-library</label> <shared-schema/> </mount-point> </schema-mounts> <CODE ENDS>¶
Figure 2 contains the YANG tree diagram [RFC8340] of the ietf-data-collection-manifest module. The module relies on the YANG Schema mount [RFC8528] to reuse existing YANG modules describing the current data collection status. We explain in Appendix C how the YANG tree is obtained.¶
The 'data-collections' container contains the information related to each YANG-Push subscription. As for the Platform Manifest, these subscriptions are indexed by the platform id, so that all subscriptions in the network can be represented in the module.¶
As most of the information related to YANG-push subscription [RFC8639] and [RFC8641] is stored in the ietf-yang-push model, we mount these modules. These modules have a part common to all subscriptions of the platform, stored in the 'streams' and 'filters' container. The information about subscriptions themselves are stored in the 'subscriptions/subscription' list, indexed by subscription id.¶
In the subscription object, the 'current-period' indicates the period currently used between two updates. That leaf can only be present when the subscription is periodic. The current period might differ from the requested period if the platform implements a mechanism to increase the collection period when it is overloaded. Having the current period information is crucial to understand if telemetry is missing because of a bug or a packet loss or simply because it was dynamically adjusted by the platform.¶
The current-period node is added by the module 'ietf-data-collection-manifest-statistics' presented in Section 4.2. This module augments the subscription list from the module 'ietf-subscribed-notifications'. It is mounted as well via the YANG Schema Mount mechanism. The module for the Data Collection Manifest is presented in Section 4.3.¶
Below is the code of the ietf-data-collection-manifest-statistics module, which augments the ietf-subscribed-notification modules to provide information needed for the Data Collection Manifest.¶
<CODE BEGINS> file "ietf-data-collection-manifest-statistics@2024-07-02.yang" module ietf-data-collection-manifest-statistics { yang-version 1.1; namespace "urn:ietf:params:xml:ns:yang:ietf-data-collection-manifest-statistics"; prefix dcs; import ietf-subscribed-notifications { prefix sn; reference "RFC 8639: A YANG Data Model for Subscriptions to Event Notifications"; } import ietf-yang-push { prefix yp; reference "RFC 8641: Subscriptions to YANG Datastores."; } organization "IETF OPSAWG (Network Configuration) Working Group"; contact "WG Web: <https://datatracker.ietf.org/wg/opsawg/> WG List: <mailto:opsawg@ietf.org> Author: Benoit Claise <mailto:benoit.claise@huawei.com> Author: Jean Quilbeuf <mailto:jean.quilbeuf@huawei.com> Author: Diego R. Lopez <diego.r.lopez@telefonica.com> Author: Ignacio Dominguez <ignacio.dominguezmartinez@telefonica.com> Author: Thomas Graf <thomas.graf@swisscom.com>"; description "This module augments subscribed notification with the current-period statistics reporting the actual collection period. The key words 'MUST', 'MUST NOT', 'REQUIRED', 'SHALL', 'SHALL NOT', 'SHOULD', 'SHOULD NOT', 'RECOMMENDED', 'NOT RECOMMENDED', 'MAY', and 'OPTIONAL' in this document are to be interpreted as described in BCP 14 (RFC 2119) (RFC 8174) when, and only when, they appear in all capitals, as shown here. Copyright (c) 2022 IETF Trust and the persons identified as authors of the code. All rights reserved. Redistribution and use in source and binary forms, with or without modification, is permitted pursuant to, and subject to the license terms contained in, the Revised BSD License set forth in Section 4.c of the IETF Trust's Legal Provisions Relating to IETF Documents (https://trustee.ietf.org/license-info). This version of this YANG module is part of RFC XXXX; see the RFC itself for full legal notices. "; revision 2024-07-02 { description "Initial revision"; reference "RFC xxxx: Title to be completed"; } augment "/sn:subscriptions/sn:subscription" { description "Add extra statistics about data collection"; leaf current-period { when '../yp:periodic'; type yp:centiseconds; description "Period during two succesive data collections, in the current state. Might differ from the configured period when the plaftorm might increase the period automatically when it is overloaded."; } } } <CODE ENDS>¶
We provide in this Section the code of the ietf-data-collection-manifest YANG module. Additionally, we provide the extension data file for YANG schema mount. The Data Collection Manifest MUST conform to the model obtained by combining these two specifications.¶
<CODE BEGINS> file "ietf-data-collection-manifest@2024-07-02.yang" module ietf-data-collection-manifest { yang-version 1.1; namespace "urn:ietf:params:xml:ns:yang:ietf-data-collection-manifest"; prefix d-mf; import ietf-platform-manifest { prefix p-mf; reference "RFC XXXX: Title to be completed"; } import ietf-yang-schema-mount { prefix yangmnt; reference "RFC8528: YANG Schema Mount"; } organization "IETF OPSAWG (Network Configuration) Working Group"; contact "WG Web: <https://datatracker.ietf.org/wg/opsawg/> WG List: <mailto:opsawg@ietf.org> Author: Benoit Claise <mailto:benoit.claise@huawei.com> Author: Jean Quilbeuf <mailto:jean.quilbeuf@huawei.com> Author: Diego R. Lopez <diego.r.lopez@telefonica.com> Author: Ignacio Dominguez <ignacio.dominguezmartinez@telefonica.com> Author: Thomas Graf <thomas.graf@swisscom.com>"; description "This module describes the context of data collection from a given network element. The contents of this model must be streamed along with the data streamed from the network element so that the context of the data collection can be retrieved later. This module must be completed with ietf-platform-manifest to capture the whole context of a data collection session. The key words 'MUST', 'MUST NOT', 'REQUIRED', 'SHALL', 'SHALL NOT', 'SHOULD', 'SHOULD NOT', 'RECOMMENDED', 'NOT RECOMMENDED', 'MAY', and 'OPTIONAL' in this document are to be interpreted as described in BCP 14 (RFC 2119) (RFC 8174) when, and only when, they appear in all capitals, as shown here. Copyright (c) 2022 IETF Trust and the persons identified as authors of the code. All rights reserved. Redistribution and use in source and binary forms, with or without modification, is permitted pursuant to, and subject to the license terms contained in, the Revised BSD License set forth in Section 4.c of the IETF Trust's Legal Provisions Relating to IETF Documents (https://trustee.ietf.org/license-info). This version of this YANG module is part of RFC XXXX; see the RFC itself for full legal notices. "; revision 2024-07-02 { description "Initial revision"; reference "RFC xxxx: Title to be completed"; } container data-collections { config false; description "Contains the configuration and statistics for the collected data, per node in the network."; list data-collection { key "platform-id"; description "Defines the information for each collected object"; leaf platform-id { type leafref { path "/p-mf:platforms/p-mf:platform/p-mf:id"; } description "Id of the platform collecting the data. This id is the same as the one in the platform manifest."; } yangmnt:mount-point "yang-push-collection"; // augment here with other kind of collection items } } } <CODE ENDS>¶
<CODE BEGINS> file "data-collection-extension-data.xml" <yang-library xmlns="urn:ietf:params:xml:ns:yang:ietf-yang-library" xmlns:ds="urn:ietf:params:xml:ns:yang:ietf-datastores"> <module-set> <name>mountee-set</name> <module> <name>ietf-subscribed-notifications</name> <revision>2019-09-09</revision> <namespace> urn:ietf:params:xml:ns:yang:ietf-subscribed-notifications </namespace> </module> <module> <name>ietf-yang-push</name> <revision>2019-09-09</revision> <namespace> urn:ietf:params:xml:ns:yang:ietf-yang-push </namespace> </module> <module> <name>ietf-data-collection-manifest-statistics</name> <revision>2024-07-02</revision> <namespace> urn:ietf:params:xml:ns:yang:ietf-data-collection-manifest-statistics </namespace> </module> <module> <name>ietf-datastores</name> <revision>2018-02-14</revision> <namespace> urn:ietf:params:xml:ns:yang:ietf-datastores </namespace> </module> <module> <name>ietf-yang-library</name> <revision>2019-01-04</revision> <namespace> urn:ietf:params:xml:ns:yang:ietf-yang-library </namespace> </module> <import-only-module> <name>ietf-inet-types</name> <revision>2013-07-15</revision> <namespace> urn:ietf:params:xml:ns:yang:ietf-inet-types </namespace> </import-only-module> <import-only-module> <name>ietf-interfaces</name> <revision>2018-02-20</revision> <namespace> urn:ietf:params:xml:ns:yang:ietf-interfaces </namespace> </import-only-module> <import-only-module> <name>ietf-ip</name> <revision>2018-02-22</revision> <namespace> urn:ietf:params:xml:ns:yang:ietf-ip </namespace> </import-only-module> <import-only-module> <name>ietf-netconf-acm</name> <revision>2018-02-14</revision> <namespace> urn:ietf:params:xml:ns:yang:ietf-netconf-acm </namespace> </import-only-module> <import-only-module> <name>ietf-network-instance</name> <revision>2019-01-21</revision> <namespace> urn:ietf:params:xml:ns:yang:ietf-network-instance </namespace> </import-only-module> <import-only-module> <name>ietf-restconf</name> <revision>2017-01-26</revision> <namespace> urn:ietf:params:xml:ns:yang:ietf-restconf </namespace> </import-only-module> <import-only-module> <name>ietf-yang-patch</name> <revision>2017-02-22</revision> <namespace> urn:ietf:params:xml:ns:yang:ietf-yang-patch </namespace> </import-only-module> <import-only-module> <name>ietf-yang-types</name> <revision>2023-01-23</revision> <namespace> urn:ietf:params:xml:ns:yang:ietf-yang-types </namespace> </import-only-module> </module-set> <schema> <name>test-schema</name> <module-set>mountee-set</module-set> </schema> <datastore> <name>ds:running</name> <schema>test-schema</schema> </datastore> <datastore> <name>ds:operational</name> <schema>test-schema</schema> </datastore> <content-id>2</content-id> </yang-library> <modules-state xmlns="urn:ietf:params:xml:ns:yang:ietf-yang-library"> <module-set-id>2</module-set-id> </modules-state> <schema-mounts xmlns="urn:ietf:params:xml:ns:yang:ietf-yang-schema-mount"> <mount-point> <module>ietf-data-collection-manifest</module> <label>yang-push-collection</label> <shared-schema/> </mount-point> </schema-mounts> <CODE ENDS>¶
The Data Manifest MUST be streamed and stored along with the collected data. In case the collected data are moved to a different place (typically a database), the Data Manifest MUST follow the collected data. This can render the collected data unusable if that context is lost, for instance when the data is stored without the relevant information. The Data Manifest MUST be updated when the Data Manifest information changes, for example, when a router is upgraded, when a new telemetry subscription is configured, or when the telemetry subscription parameters change. The Data Manifest can itself be considered as a time series, and stored in a similar fashion to the collected data.¶
The collected data should be mapped to the Data Manifest. Since the Data Manifest will not change as frequently as the collected data itself, it makes sense to map several data to the same Data Manifest. Somehow, the collected data must include a metadata pointing to the corresponding Data Manifest. In case of Data Manifest change, the system should keep the mapping between the data collected so far and the old Data Manifest, and not assume that the latest Data Manifest is valid for the entire time series.¶
The Platform Manifest is likely to remain the same until the platform is updated. Thus, the Platform Manifest only needs to be collected once per streaming session and updated after a platform reboot.¶
Similarly, the elements common to all subscriptions, such as the stream definitions and the common filters might be updated less frequently than the subscriptions.¶
As this draft specifically focuses on giving context on data collected via streamed telemetry, we can assume that a streaming telemetry system is available. Retrieving the Data Collection Manifest and Platform Manifest can be done either by reusing that streaming telemetry system (in-band) or using another system (out-of-band), for instance by adding headers or saving manifests into a YANG instance file [RFC9195].¶
We propose to reuse the existing telemetry system (in-band approach) in order to lower the efforts for implementing this draft. To enable a platform supporting streaming telemetry to also support the Data Manifest, it is sufficient that this platform supports the models from Section 3 and Section 4. Recall that each type of manifest has its own rough frequency update, i.e. at reboot for the Platform Manifest and at new subscription or CPU load variation for the Data Collection Manifest. The Data Manifest MUST be streamed with the YANG-Push on-change feature [RFC8641] (also called event-driven telemetry). Appendix A shows how the in-band approach would work while storing to a time-series database (TSDB).¶
The underlying time series database should accommodate the various rates at which different part of the Data Manifest are updated. In particular, storing the Platform Manifest should be optimized to avoid duplicating repeated content and only storing a new version when there is a change in the manifest.¶
With YANG-push, each notification sent by the device is part of a subscription, which is also one of the YANG keys used to retrieve the Data Manifest, the other key being the platform ID. In order to enable a posteriori retrieval of the Data Manifest associated to a datapoint, the collector must:¶
With this information, to retrieve the Data Manifest from the datapoint, the following happens:¶
We don’t focus on the timing aspect as storing both the data and their manifest in a time series database (TSDB) will allow the data scientists to look for the Data Manifest corresponding to the timestamp of the datapoint. More precisely, given the timestamp of a collected datapoint, the query to the TSDB would be to get the last version of the Data Manifest before that timestamp. In that scenario, the reliability of the collection of the Data Manifest is the same as the reliability of the data collection itself, since the Data Manifest is like any other data.¶
It is expected that the Data Manifest is streamed directly from the network equipment, along with YANG-Push [RFC8641] data. However, if the network equipment streaming telemetry does not support yet the YANG modules from the Data Manifest specified in this document, the telemetry collector could populate the Data Manifest from available information collected from the platform. However, this option requires efforts on the telemetry collector side, as the information gathered in the Data Manifest proposed in this document could be scattered among various standard and vendor- specific YANG modules [RFC8199], that depend on the platform.¶
That Data Manifest should be kept and available even if the source platform is not accessible (from the collection system), or if the platform has been updated (new operating system or new configuration). The Platform Manifest is "pretty" stable and should change only when the platform is updated or patched. On the other hand, the Data Collection Manifest is likely to change each time a new YANG-Push subscription [RFC8641] is requested and might even change if the platform load increases and collection periods are updated. To separate these two parts, we enclose each of them in its own module.¶
Below is an example of both a Platform manifest and corresponding Data Collection Manifests. The list of YANG modules in the yang-library container is kept empty for brevity.¶
<CODE BEGINS> file "manifests-example.json"¶
<CODE ENDS>¶
The file above contains the Data Collection Manifest for two XPaths subscriptions. With the Data Collection Manifest for the first one, with subscription id 4242, the exact semantics of the collected path, here the administrative status of the network interfaces, can be obtained by looking up the module in the yang-library of the corresponding Platform Manifest, in order to obtain the exact revision of ietf-interfaces used at collection time. Also, the "on-change" container indicates that data will be sent only if there is a change, thus not receiving data indicates that the administrative status of the interface did not change.¶
The other example of Data Collection Manifest, with subscription id 4243, shows how a periodic subscription is reported. In that example, the current-period indicates that the requested period of 10s (1000 centiseconds) could not be attained and is now of 20s, for instance because the device is overloaded.¶
As we are reusing an existing telemetry system, the security considerations lies with the new content divulged in the new manifests. Appropriate access control must be associated to the corresponding leafs and containers.¶
The integrity and provenance of the data of the collection manifest can be ensured by a signing mechanism such as [I-D.lopez-opsawg-yang-provenance].¶
This document includes no request to IANA.¶
In this example, the goal is to collect the administrative status and number of received bytes for the interfaces of a fictional ACME device, and store the result in a time-series database (TSDB). The metrics are collected via YANG-Push, which is configured by specifying their XPaths and when they should be collected (periodically or on-change). More precisely, we want collect "ietf-interfaces:interfaces/interface/enabled" on every change and "ietf-interfaces:interfaces/interface/statistics/in-octets" every 100 milliseconds. The paths here are referring to the YANG module from [RFC8343]. The configuration of YANG push is out of scope for this document. Since they don’t have the same trigger, each of the path must be collected in its own subscription. Figure 4 presents an example for such a collection.¶
In the scenario from Figure 4, the collector receives YANG-push from the device and stores it into a TSDB. We first present a version without Data Manifest and then how to enrich it with the Data Manifest.¶
We use the notation from [I-D.kll-yang-label-tsdb] to represent how the data is stored in the TSDB. Without the data manifest, the result of the collection would be stored as showed in Figure 5. The "host" label indicates the devices from which the data is collected and the YANG keys are included as well. Here the interface "eth0" is enabled and received 1234 octets. In that case, the value is stored, without any way to know how the value was obtained.¶
A possibility for keeping the Data Manifest with the data is to store it directly into the TSDB. In that case, the collector can subscribe to the data exported by the module presented in this draft and store it as other metrics. For the Platform Manifest, assuming the platform ID is "PE1", the collector subscribes to the path "ietf-platform-manifest:platforms/platform[id=PE1]". For the Data Collection Manifests, the collector subscribes to the path "ietf-data-collection-manifest:data-collections/data-collection[platform-id="PE1"]/yang-push-collection/subscriptions/subscription[id=X]" where X is the subscription id of existing subscriptions. With the approach from [I-D.kll-yang-label-tsdb], the corresponding subtrees would be split into a set of datapoints, one per leaf. Figure 6 shows two examples of storing leaves in a TSDB. The first leaf is the vendor PEN number, which is part of the Platform Manifest. The second leaf is the Xpath filter used for subscription to the interface status.¶
In the labels, the "host" might be different from the "platforms_platform_id" in case the collector is the one assembling it, i.e. for devices that do not natively support the Data Manifest. In that case, the value of this label could be the hostname of the collector. The host value does not matter for retrieving the Data Manifest as the platform id is the meaningful field.¶
In our example, we can retrieve the Platform Manifest associated to a collected datapoint by looking for datapoints that have the label "platforms_platform_id" equal to the value of the host for that collected datapoint. In order to link a datapoint with the corresponding Data Collection Manifest, we need to add an additional label for the subscription id. For instance, the same datapoints as in Figure 5 could be stored as in Figure 7.¶
From the "interfaces_interface_enabled" datapoint, one can retrieve the corresponding Data Collection Manifest by looking for datapoints that have the label data_collections_data_collection_yang_push_collection_subscriptions_subscription_id equal to 4242.¶
Various optimizations could be done, such as relying on on-change subscription to modify only the leaves that changed. In that way, the amount of data needed for updating and storing the Data Manifest in the TSDB would be limited.¶
v04 -> v05¶
v03 -> v04¶
v02 -> v03¶
v01 -> v02¶
v00 (WG adoption) - v01¶
v05 -> v06¶
v04 -> v05¶
v03 -> v04¶
v02 -> v03¶
v01 -> v02¶
v00 -> v01¶
v00¶
This section provides the files needed to generate the YANG tree diagrams [RFC8340] from Figure 1 and Figure 2. The diagrams were obtained using yanglint https://github.com/CESNET/libyang version 2.1.80, using the YANG Schema Mount [RFC8528]. They were manually edited to remove parts irrelevant to this document such as data nodes from imported modules, notifications and RPCs.¶
In order to get a tree diagram involving YANG Schema Mount with yanglint, two data files are required, in addition to the YANG module, its dependencies and the YANG modules to be mounted. First we need the extension data, containing the YANG library to use at the mount point. Then we need the YANG library to use at the top-level context. We provide below the commands used to get the "raw" YANG Tree diagrams from these files.¶
For the Platform Manifest, the extension data is provided in Section 3.2 as "platform-extension-data.xml". The top-level YANG library is included below as "platform-toplevel-yanglib.xml". The following command was used to obtain the YANG Tree diagram (before manual edition).¶
yanglint -f tree \ -x platform-extension-data.xml \ -Y platform-toplevel-yanglib.xml \ ietf-platform-manifest@2024-07-02.yang¶
<CODE BEGINS> file "platform-toplevel-yanglib.xml" <yang-library xmlns="urn:ietf:params:xml:ns:yang:ietf-yang-library" xmlns:ds="urn:ietf:params:xml:ns:yang:ietf-datastores"> <module-set> <name>main-set</name> <module> <name>ietf-datastores</name> <revision>2018-02-14</revision> <namespace> urn:ietf:params:xml:ns:yang:ietf-datastores </namespace> </module> <module> <name>ietf-yang-library</name> <revision>2019-01-04</revision> <namespace> urn:ietf:params:xml:ns:yang:ietf-yang-library </namespace> </module> <module> <name>ietf-yang-schema-mount</name> <revision>2019-01-14</revision> <namespace> urn:ietf:params:xml:ns:yang:ietf-yang-schema-mount </namespace> </module> <module> <name>ietf-platform-manifest</name> <revision>2024-07-02</revision> <namespace>urn:ietf:params:xml:ns:yang:ietf-platform-manifest</namespace> </module> <import-only-module> <name>ietf-yang-types</name> <revision>2023-01-23</revision> <namespace> urn:ietf:params:xml:ns:yang:ietf-yang-types </namespace> </import-only-module> <import-only-module> <name>ietf-inet-types</name> <revision>2013-07-15</revision> <namespace> urn:ietf:params:xml:ns:yang:ietf-inet-types </namespace> </import-only-module> </module-set> <schema> <name>main-schema</name> <module-set>main-set</module-set> </schema> <datastore> <name>ds:running</name> <schema>main-schema</schema> </datastore> <datastore> <name>ds:operational</name> <schema>main-schema</schema> </datastore> <content-id>1</content-id> </yang-library> <modules-state xmlns="urn:ietf:params:xml:ns:yang:ietf-yang-library"> <module-set-id>2</module-set-id> </modules-state> <CODE ENDS>¶
For the Data Collection Manifest, the extension data is provided in Section 4.3 as "data-collection-extension-data.xml". The top-level YANG library is included below as "data-collection-toplevel-yanglib.xml". The following command was used to obtain the YANG Tree diagram (before manual edition).¶
yanglint -f tree \ -x data-collection-extension-data.xml \ -Y data-collection-toplevel-yanglib.xml \ ietf-data-collection-manifest@2024-07-02.yang¶
<CODE BEGINS> file "data-collection-toplevel-yanglib.xml" <yang-library xmlns="urn:ietf:params:xml:ns:yang:ietf-yang-library" xmlns:ds="urn:ietf:params:xml:ns:yang:ietf-datastores"> <module-set> <name>main-set</name> <module> <name>ietf-datastores</name> <revision>2018-02-14</revision> <namespace> urn:ietf:params:xml:ns:yang:ietf-datastores </namespace> </module> <module> <name>ietf-yang-library</name> <revision>2019-01-04</revision> <namespace> urn:ietf:params:xml:ns:yang:ietf-yang-library </namespace> </module> <module> <name>ietf-yang-schema-mount</name> <revision>2019-01-14</revision> <namespace> urn:ietf:params:xml:ns:yang:ietf-yang-schema-mount </namespace> </module> <module> <name>ietf-data-collection-manifest</name> <revision>2024-07-02</revision> <namespace> urn:ietf:params:xml:ns:yang:ietf-data-collection-manifest </namespace> </module> <module> <name>ietf-platform-manifest</name> <revision>2024-07-02</revision> <namespace> urn:ietf:params:xml:ns:yang:ietf-platform-manifest </namespace> </module> <import-only-module> <name>ietf-inet-types</name> <revision>2013-07-15</revision> <namespace> urn:ietf:params:xml:ns:yang:ietf-inet-types </namespace> </import-only-module> <import-only-module> <name>ietf-yang-types</name> <revision>2023-01-23</revision> <namespace> urn:ietf:params:xml:ns:yang:ietf-yang-types </namespace> </import-only-module> </module-set> <schema> <name>main-schema</name> <module-set>main-set</module-set> </schema> <datastore> <name>ds:running</name> <schema>main-schema</schema> </datastore> <datastore> <name>ds:operational</name> <schema>main-schema</schema> </datastore> <content-id>1</content-id> </yang-library> <modules-state xmlns="urn:ietf:params:xml:ns:yang:ietf-yang-library"> <module-set-id>2</module-set-id> </modules-state> <CODE ENDS>¶
Thanks to Mohamed Boucadair, Tianran Zhou and Jan Lindblad for their reviews and comments.¶