Exciting Announcement! In celebration of launching our AI Certification, we’re thrilled to offer a 50% discount exclusively. Seize this unique chance—don’t let it slip by!

Categories
Open Packet Broker SONiC

Syncing Success: Elevating Network Monitoring with Time-Synced Excellence in the SONiC Landscape

In the dynamic landscape of network monitoring, Time Synchronization emerges as a pivotal force, particularly in industries where precise packet timing is paramount. This is evident in time-sensitive applications like algorithmic trading platforms, emergency response systems, and Telco network monitoring, where split-second decisions are imperative. It forms the bedrock for achieving optimal Quality of Service (QoS), fault detection-diagnosis, and security threat detection. From enhancing call detail record analysis to synchronizing subscriber experience monitoring, Time Synchronization emerges as the unsung hero, orchestrating precision and efficiency in the symphony of network operations.

Why do we need packet timestamping?

Precise timestamps help pinpoint delays, identify network bottlenecks, optimize routing, and ensure adherence to service-level agreements.

  1. 1.

    Detecting the congestion point on the path of a flow:
    Monitor packet delays at various points along the path by analyzing the corresponding packet timestamps. This helps in Jitter, Throughput analysis and packet loss detection as well.

  2. 2.

    Path Tracing:
    By examining timestamps at different network devices, administrators can trace the path of a flow and pinpoint specific devices or links where congestion is likely occurring.

  3. 3.

    Arrival sequence validation:
    Arrival sequence validation helps confirm that packets are reaching their destination in the correct order. Also helps in achieving the protocol compliance, avoid data corruption and reliability

  4. 4.

    Security incident investigation:
    In cybersecurity, timestamps are essential for investigating security incidents. Analyzing the timing of events helps in understanding the sequence of actions during an incident

  5. 5.

    Troubleshoot and debugging the network delays :
    Timestamps facilitate the correlation of events across different network devices, aiding in troubleshooting and debugging by establishing a chronological order of occurrences

  6. 6.

    Dynamic Path Adjustments:
    Implement dynamic path adjustments to reroute traffic away from congested paths. This adaptive approach helps in mitigating congestion dynamically.

By employing a packet timestamping feature, network administrators can effectively detect congestion points, network delays, threat issues allowing for proactive management and optimization of network performance. Regular monitoring and analysis are essential for maintaining a resilient and efficient network.

How are we enabling Network Administrators?

Open Packet Broker (OPB) is the industry’s first software-based containerized Network Packet Broker (NPB) application built on top of the open-source SONIC NOS to enable monitoring and security tools to access the network traffic. OPBNOS stands out with its support for packet timestamping. Leveraging modern ASIC capabilities, it allows users to configure timestamps per port or flow, providing unparalleled precision. Packet timestamps can be added at ingress/egress at every port. Achieving precise time synchronization in network packet broking can be accomplished through two essential methods.

  1. 1.

    Timestamping the packetsintercepted by the network packet broker devices is a fundamental approach. This involves assigning a precise time reference to each packet, allowing for accurate sequencing and analysis.

  2. 2.

    Synchronizing the network packet brokerswith the network time. This synchronization can be achieved through widely used protocols such as Network Time Protocol (NTP) or high-precision Precision Time Protocol (PTP).

Network operators would like to insert timestamp to all the packets ingressing from network ports and egressing out to tool ports.

Fig : Deployment representation of Time-Synchronized OPB Network

opbnos# conf t
opbnos(config)# timestamping enable | disable
opbnos# conf t
opbnos(config)# interface ethernet Ethernet1/1
opbnos(config-if)# timestamp enable stage ingress source-id NE1Eth1
opbnos# conf t
opbnos(config)# interface ethernet Ethernet2/1
opbnos(config-if)# timestamp enable stage egress source-id NE1Eth2

Fig : TimeStamp Configuration at Interface level in OPBNOS

OPBNOS also offers the packet timestamp decoder which helps in analyzing the packet capture dump and decode the timestamp info for the customers. Also, it is use-case driven where the analyzer can be extended to serve any specific use-cases post decoding in the future.

test@aviz ~ % python3 timestamp_decoder.py
Timestamp Data : 0xebb8a66c01a05bd592ba00f577980000000001a05bd59584005bbbdd Source-1 : Seconds 1665 and Nanoseconds 466981562 and origin id : 0x7abbcc Source-2 : Seconds 1665 and Nanoseconds 466982276 and origin id : 0x2dddee Time Difference : 0 Seconds and 714 Nanoseconds

Fig : TimeStamp Decoder to verify/test the time difference in Network.

Conclusion

In conclusion, the synergy of packet timestamping is the bedrock of the modern network monitoring world. Packet timestamping, with its precision, lends a temporal dimension to data, enabling meticulous analysis, troubleshooting, and compliance. When integrated seamlessly into any network monitoring using Open Packet Broker (OPB based on SONIC NOS) , this timestamp feature becomes invaluable, orchestrating the symphony of network operations.

Time is not just a metric; it’s the heartbeat of network resilience and innovation.

 

FAQs

For any further queries or more information, please don’t hesitate to contact us.

Categories
SONiC

Elevate Your Career with Aviz Certified SONiC Professional

Are you ready to take your networking career to the next level? Discover how the Aviz Certified SONiC Professional certification can unlock your potential in the revolutionary world of SONiC technology.

Today, we are delighted to unveil the Aviz Certified SONiC Professional (ACSP) certification, a testament to our dedication to improve the abilities of individuals in the fast-paced SONiC environment.

How is Aviz Networks Raising the Bar for Industry Standards?

With technology constantly evolving, the demand for skilled professionals capable of mastering intricate solutions becomes paramount. Aviz Networks recognises the essential contribution of SONiC (Software for Open Networking in the Cloud) in shaping the future of networking. Our pioneering certification programme seeks to set a distinctive standard for SONiC competence and superiority.

Mastering SONiC: A Comprehensive Certification Overview

Created particularly for network engineers working with SONiC, SONiC developers, SONiC testers wanting to enhance their abilities, and the professionals wanting to switch to SONiC, the Aviz Certified SONiC Professional certification serves as an essential bridge to the world of SONiC. It covers important concepts such as below of SONiC. 

  • Architecture
  • Fundamentals, Core & SONiC Infrastructure
  • Platform, Layer2 & Layer3 expertise
  • Debugging & Troubleshooting

How to Advance Your Career with SONiC Technology Certification

Being awarded this certification is more than just obtaining a qualification – it opens the door to career progression and industry recognition. As an innovator in SONiC technology, Aviz Networks provides this certification as a robust preparatory base for professionals to confidently venture into the SONiC industry.

Top Benefits of Aviz Networks’ SONiC Certification for Professionals

    • Holistic Knowledge: Our certification lays emphasis on all important SONiC concepts, ensuring well-rounded understanding.
    • Industry Recognition: Being the first in its field, this certification carries significant industry value and recognition.
    • Career path opportunities: This certification will help if you want to switch to SONiC development or testing, excel in companies migrating to SONiC.
    • Early mover advantage: SONiC is a relatively young NOS. It is the fastest growing open source NOS. Being in this Industry is somewhat similar to joining one of the giants when the networking industry was starting. And having ACSP is akin to obtaining one of the prestigious certifications in the very beginning.
    • Online test: The test is completely online. One can take it anytime and from anywhere.

    Begin Your SONiC Certification Journey 

    Here are some basic exam details:

    • Duration: 1 hour
    • Format: 50 multiple-choice questions covering various difficulty levels
    • No Negative Marking: We encourage learning and advancement without fear of penalties.

    How to prepare for the Aviz Certified SONiC Professional?

    There are multiple sources you can use to prepare for the SONiC Certification. SONiC being the fastest growing open source NOS, there is a wealth of knowledge on the Internet. Here are some of the sources you can use to prepare for your Aviz Certified SONiC Professional certification:

    • There are a lot of channels on YouTube, which cover the basics and SONiC configurations at various complexity levels. There are also the Open Source Community channels like OCP, NANOG etc.
    • Bootcamps: Apart from this, there are a lot of boot camps organized by various companies which cover various aspects of SONiC. Aviz also organizes several in-person and virtual bootcamps throughout the year. Go to aviznetworks.com -> Events for the information about the latest bootcamps.
    • Basic Network certification materials: ACSP is a SONiC certification. However the major role of SONiC is in handling Layer 2 and Layer 3 traffic. Hence the certification doesn’t only confine to pure SONiC skills in these areas. It is recommended to up your routing and switching skills to at least CCNP levels training. Apart from this you should know about Layer 4 concepts in detail. 
    • SONiC Command Line CLI page: A lot of questions in the ACSP certification need SONiC CLI. You must thoroughly prepare the SONIC Command Line Interface Guide in the community.
    • SONiC Architecture Guide and important links: Community maintains a very good SONiC Architecture Guide and there are a lot of links on that page. These pages must be gone thoroughly to be adequately prepared for this certification. 

    Step-by-Step Guide to Achieving Aviz Certification

    Following flow diagram summarizes the process of getting Aviz Certified SONiC Professional Certification. 

    How to Become an Aviz Certified SONiC Professional

    Are you ready to become an Aviz Certified SONiC Professional? Visit our certification page for complete exam details and registration. 

    Join the Conversation

    We recognize the power of community collaboration and dialogue. Join us on our YouTube Channel and follow our LinkedIn page to take part in discussions.

    Stay informed about our upcoming SONiC course from Aviz Networks.

    At Aviz Networks, we’re not just shaping the future of networking; we’re also dedicated to fostering the career development of individuals who seek to make a difference in this rapidly evolving field. The Aviz Certified SONiC Professional certification is more than just a validation of your skills; it’s your gateway to numerous opportunities in the world of SONiC technology.

    We eagerly anticipate witnessing your successes as you embark on this transformative journey in your professional endeavors.

    Categories
    Open Networking Enterprise Suite SONiC

    6 KPIs for your SONiC NetOps system to scale your networks efficiently – Learn how ONES telemetry module works to achieve it

    Categories
    SONiC

    Validate SONiC with high Quality Bar for Your Mission Critical Use Cases

    Try FTAS Latest 2.3 Release with EVPN, MC-LAG, ECMP and focus on Scaling

    Try Fabric Test Automation Suite (FTAS) 2.3 Latest Release  – a robust suite of test cases meticulously designed to evaluate the deployment readiness of SONiC. Our unwavering commitment to improvement has shaped FTAS, evolving through invaluable customer feedback and integrating new features in alignment with the latest SONiC releases.

    Exciting news on our journey – FTAS 2.3 is now live! This version brings a myriad of enhancements, all aimed at refining SONiC assessment and streamlining pre-deployment testing. Let’s explore the latest features and discover how they can enhance your testing experience!

    4 Exciting Features in FTAS 2.3 You Should Know

    1. Decoding the Technological Tapestry of Data Center Interconnect
    • Data Center Interconnect (DCI) orchestrates a seamless integration of cutting-edge technologies. EVPN (Ethernet VPN), VXLAN (Virtual Extensible LAN), and MCLAG (Multi-Chassis Link Aggregation Group) collaboratively contribute to the intricate dance of network orchestration. EVPN dynamically manages control plane operations, VXLAN extends virtual networks seamlessly, and MCLAG ensures redundancy and load balancing. This technical synergy within DCI forms a robust infrastructure, adept at addressing the challenges of interconnected data centers. Each technology plays a pivotal role in achieving fluidity and resilience in the overall network composition.In response to real-world customer deployments, the latest release of FTAS introduces a host of new test cases, enhancing the existing coverage and ensuring a comprehensive evaluation of your network infrastructure.
    1. The Advantages of BGP-EVPN with VXLAN in Data Center Communication
    • Seamless Connectivity:
      • VXLAN EVPN ensures uninterrupted data flow by extending Layer 2 connectivity over a Layer 3 network.
    • Efficient Workload Management:
      • Dynamically manages virtualized workloads and facilitates precise distribution of MAC and IP addresses.
    • Scalability and Flexibility:
      • Adapts to the evolving demands of modern data center deployments, providing a reliable solution for network virtualization.
    • Enhanced Connectivity in FTAS 2.3:
      • Explore diverse scenarios covered by VXLAN EVPN in FTAS 2.3, showcasing its ability to enhance network connectivity and foster resilience.
    • Discover the power of BGP-EVPN with VXLAN, contributing to seamless communication, efficient resource utilization, and enhanced data center performance. Explore FTAS 2.3 for a deeper insight into these advantages.
    EVPN/VXLAN Topology
    EVPN/VXLAN Topology

    Here are the VXLAN EVPN scenarios covered in FTAS 2.3.

    Feature Test Scenario
    BGP-EVPN-VXLAN EVPN VXLAN Configuration and verification
    EVPN VXLAN for known unicast, BUM traffic (eBGP) with RIF
    EVPN VXLAN for known unicast, BUM traffic (eBGP) with SVI
    EVPN VXLAN for known unicast traffic (eBGP) with link events and router failure – RIF
    EVPN VXLAN for known unicast traffic (eBGP) with link events and router failure – SVI
    EVPN VXLAN for known unicast traffic (eBGP) with link events and router failure – RPCH
    Symmetric IRB with EVPN eBGP-RIF (eBGP)
    Symmetric IRB with EVPN iBGP-RIF (iBGP)
    Symmetric IRB with EVPN iBGP-SVI
    Symmetric IRB with EVPN eBGP-RPCH
    Symmetric IRB with EVPN iBGP-RPCH
    Asymmetric IRB with EVPN eBGP
    Asymmetric IRB with EVPN iBGP
    1. What is MCLAG and How Does it Enhance Data Center Reliability?
    • MCLAG (Multi-Chassis Link Aggregation) stands as a beacon for enhancing data center reliability, fostering a resilient and redundant network infrastructure. In the face of a switch failure, MCLAG seamlessly transfers traffic to a backup switch, effectively preventing downtime. This technology ensures high availability, load balancing, and simplified network design, making it indispensable for critical applications where continuous connectivity is paramount.Discover the breadth of MCLAG use cases covered in FTAS 2.3, showcasing its ability to fortify data center networks and elevate reliability based on real-world customer deployments.
    L2 MCLAG
    L2 MCLAG
    L3 MCLAG
    L3 MCLAG

    FTAS 2.3 covers following MCLAG use cases based upon the customer deployments.

    Deployment Test Scenario
    L2 MCLAG MCLAG-L2 Steady State with port channel and vlan on peer link
    MCLAG-L2 Member Link Down
    MCLAG-L2 Peer Link Down
    MCLAG-L2 Active Reboot
    MCLAG-L2 Standby Reboot
    L3 MCLAG MC-LAG L3 validation using port channel configuration
    MC-LAG L3 validation, Bring down the member link of Spine1
    MC-LAG L3 keepalive link down
    MCLAG-L3 Active Reboot
    MCLAG-L3 Standby Reboot
    1. Maximizing Network Efficiency: A Deep Dive into ECMP Scaling & Resilience
    •  Mastering ECMP: Navigating Scaling and Resilience Essentials- 
    • Intelligent Traffic Distribution:
      • ECMP optimizes network performance by intelligently distributing traffic across equal-cost paths.
    • Enhanced Load Balancing:
      • In data centers, ECMP significantly enhances load balancing for efficient traffic distribution.
    • Increased Bandwidth Utilization:
      • ECMP maximizes efficiency by substantially increasing bandwidth utilization.
    • Fault Tolerance Assurance:
      • Ensures fault tolerance, maintaining network resilience in the face of disruptions.
    • Crucial in Modern Environments:
      • In dynamic environments, ECMP ensures efficient resource utilization and high availability.
    • FTAS 2.3 Advancements:
      • FTAS 2.3 introduces eBGP support, enhancing ECMP scalability for advanced network capabilities.
    • Explore ECMP eBGP Scenarios:
      • Delve into diverse ECMP eBGP scalability scenarios in FTAS 2.3 for insights into scalable and resilient networks.
    • Unlock the potential of ECMP Scaling & Resilience with FTAS, from foundational principles to advanced capabilities in FTAS 2.3.Here is the list of ECMP eBGP scalability scenarios added in FTAS 2.3.
    Test Scenario Description
    ECMP Scalability – eBGP support Scalability to 16 ECMP paths and 32K eBGP routes
    Scalability to 32 ECMP paths and 64K eBGP routes
    Scalability to 64 ECMP paths and 128K eBGP routes
    Scalability to <number of leaf1 ports> ECMP paths and 1K eBGP routes per path

    Conclusion

    FTAS 2.3 has arrived, brimming with enhancements to facilitate seamless SONiC assessment. This latest release introduces an array of new VXLAN EVPN scenarios, encompassing diverse test cases tailored for efficient data center communication. Furthermore, MCLAG use cases guarantee the establishment of resilient, redundant networks, providing a safeguard against downtime during switch failures. Notably, FTAS 2.3 extends its support for ECMP scalability with eBGP, elevating both network performance and fault tolerance.

    Dive into the specifics of these upgrades, ranging from VXLAN EVPN configurations to MCLAG and ECMP eBGP scalability scenarios. With its comprehensive coverage and improved capabilities, FTAS 2.3 promises to elevate your testing experience, ensuring your SONiC assessment is not just thorough but also enriched with advanced features.

    Schedule a Call with Our Team to Delve into FTAS. For comprehensive information before the scheduled call, visit our FTAS product page.

    Categories
    Open Networking Enterprise Suite SONiC

    ONES 2.0: SONiC Networks support with Integrated Slack and Zendesk

    Introducing ONES 2.0: Advanced Alerting Integrated with Slack & Zendesk

    In the ever-evolving landscape of network operations (NetOps), staying ahead of potential issues and ensuring seamless functionality is imperative. We’re thrilled to unveil a groundbreaking enhancement to our NetOps product ONES – an intelligent alerting and notification system that seamlessly integrates with Slack and Zendesk. This new feature harnesses the power of an Advanced Rule Engine to deliver proactive alerts and streamline incident management, revolutionizing the way teams handle network complexities.

    How Does ONES Intelligent Integration Simplify Network Infrastructure Management?

    NetOps professionals are consistently challenged with the task of monitoring and managing intricate network infrastructures. The integration of an Intelligent Rule Engine within our platform marks a significant leap forward in simplifying this process. By leveraging sophisticated algorithms and machine learning capabilities, this enhancement enables the automated detection of anomalies and potential issues within the network.

    ONES-based intelligent rule engine goes beyond simple threshold-based alerting to analyze vast amounts of network data and identify patterns that indicate potential issues. This intelligent approach to alerting helps reduce noise and ensures that network administrators are only notified about the most critical events, freeing up their time to focus on resolving issues rather than sorting through a barrage of irrelevant alerts.

    ONES allows users to provide webhooks to integrate Slack Messaging and Zendesk Ticketing for Rule Engine

    Slack Channel Name and URL
    Slack Channel Name and URL
    Zendesk Ticketing URL and API Token
    Zendesk Ticketing URL and API Token

    What Makes Proactive Alerting Essential for Swift Action?

    The key highlight of this integration is its ability to generate real-time alerts based on predefined conditions and thresholds. Whether it’s a sudden surge in network traffic, a device experiencing performance degradation, platform health, resource utilization, link changes or any other critical event, the system can instantly identify, categorize, and trigger alerts.

    These alerts are not merely generic notifications; they’re intelligently crafted to provide actionable insights. This means that your team receives specific and contextual information regarding the nature of the issue, its severity, and recommended actions, empowering them to take swift, informed measures.

    How can Slack and Zendesk integration enhance incident management?

    The integration extends beyond generating alerts; it seamlessly connects with popular collaboration tools like Slack and ticketing systems like Zendesk. Through these integrations, alerts are directly pushed to designated channels or individuals on Slack, ensuring immediate visibility across relevant teams.

    Moreover, the seamless linkage with Zendesk facilitates automatic ticket creation or updates when an alert is triggered. This streamlines incident management workflows, allowing teams to initiate prompt responses and maintain a comprehensive log of network-related incidents within their existing ticketing system.

    Rules: Creating a New Rule
    Rules: Creating a New Rule

    As part of supportability functions. ONES also offers users with ease to 

    • generate tech support dumps and syslogs for endpoints
    • access to switch via Console and SSH

    Login Credentials and Syslogs Count
    Login Credentials and Syslogs Count
    Login Prompt
    Login Prompt

    Key Benefits of Integrating ONES Rule Engine with Slack and Zendesk

    Not every situation can be addressed with a single solution. Recognizing this, our intelligent alerting system offers extensive customization options. Users can define and fine-tune rules according to their network’s specific requirements, adjusting thresholds, conditions, and notification channels as needed. This scalability ensures that the system adapts and grows with your network infrastructure.

    Key Benefits:

    • Reduced alert fatigue: By intelligently filtering out irrelevant alerts, you can minimize alert fatigue and ensure that network administrators are only notified about critical events.
    • Faster response times: With critical alerts delivered directly to Slack and Zendesk, your team can respond to network issues more quickly and efficiently.
    • Improved collaboration: Seamless integration with collaboration platforms fosters better communication and collaboration among team members, ensuring that everyone is kept informed and can contribute to resolving issues effectively.
    • Weekly Digest: Weekly digest gives an apparent summary of all the logs generated by Rule Engine, it provides summarized details including Metric Name, the type of the threshold, Device information with MAC Address and the device IPs, so the NetOps team can easily get a clear idea what actually triggered throughout the week 

    The Power of Innovative Integration: Enhancing Efficiency and Reliability in NetOps

    With this innovative integration, NetOps teams can bid farewell to reactive firefighting and embrace a proactive approach to network management. The intelligent rule engine significantly reduces the mean time to detect (MTTD) and mean time to respond (MTTR), enhancing operational efficiency and bolstering network reliability.

    Conclusion

    In an era where network stability drives business success, our intelligent alerting system, ONES 2.0, represents a paradigm shift in NetOps. The integration of an advanced rule engine with Slack and Zendesk empowers proactive network management, ensuring smooth operations and swift incident resolution.

    Step into the future of NetOps with our enhanced ONES 2.0, unlocking elevated efficiency and reliability in network infrastructure management. Stay ahead, act decisively, and elevate your network operations with our intelligent alerting integration.

    Our alert and notification system extends beyond Slack and Zendesk, applicable to any messaging app or ticketing system, ensuring you stay on top of priorities.

    To witness how ONES 2.0 can revolutionize your NetOps, schedule a demo or Contact us today for a firsthand experience!

    Categories
    Open Networking Enterprise Suite SONiC

    Aviz ONES 2.0: Closing in on the Reality of SONiC-based AI Fabrics

    As technology advances, several trends are emerging in the application of Generative AI for networking, paving the way for more intelligent and adaptive network infrastructures. Some notable trends include Predictive Network Analytics, AI-Enhanced QOS, Network Resource Optimization, Anomaly Detection, Simulation of Realistic Network Environments, Autonomous Network Operations. RoCE (RDMA over Converged Ethernet) can address several challenges posed to networking devices in the context of Generative AI. 

    This serves as the foundation for the AI fabric due its improved model training speed, optimized and reliable data movement and its compatibility with Ethernet networks. Effective monitoring of RoCE traffic becomes instrumental in maintaining seamless operations. 

    Another important technique, proactive congestion management is crucial for maintaining optimal performance, reliability, and efficiency. AI workloads often involve the exchange of large datasets and real-time communication between nodes. Network congestion can lead to performance degradation, slowing down data transfers and compromising the responsiveness of AI applications. By identifying and addressing potential congestion points before they impact performance, proactive congestion management helps prevent degradation in the performance of generative AI tasks. This ensures that AI models can operate at optimal speeds, meeting the demands of real-time or near-real-time processing needs.

     AI Fabric Insight- AI workload with GPU and DPU.
    AI Fabric Insight

    ONES – Crafted for SONiC based AI Fabric

    In the ever-evolving realm of generative AI networks, where the need for high-performance and low-latency communication takes center stage, ONES 2.0 is set to redefine network optimization. This latest release presents a state-of-the-art solution meticulously crafted to streamline network operations. ONES seamlessly incorporates advanced features such as Priority Flow Control (PFC) counters for RoCE support, and proactive congestion management based on port and per port queue utilization details. ONES supports the collection of the metrics aiding the SONiC-Fabrics with AI across multiple vendor platforms offering excellent scalability support and powerhouses the data collection process. It also seamlessly integrates with the ONES ecosystem – orchestration, visibility, and support for third-party APIs including REST and Prometheus – offering the go-to solution for streamlined management, comprehensive monitoring, and flexible interoperability in complex network environments.

    ONES Unveiling SONiC AI Fabrics & RoCE: A Visual Exploration 

    ONES collects a set of valuable metrics that is instrumental in monitoring RoCE (RDMA over Converged Ethernet) as it provides insights into the flow control mechanisms and helps ensure the efficient and reliable communication of RoCE-enabled networks.  

    How Does Metric Collection Empower AI Fabrics to Tackle Challenges?

    • Traffic Prioritization: These metrics reveal how different types of traffic are prioritized in the network. In RoCE, where low-latency communication is crucial, the ability to prioritize traffic ensures that RDMA operations and other critical data transmissions are given precedence.
    • Congestion Management: Help in monitoring and managing network congestion. RoCE networks can experience congestion, and PFC allows for the pause of non-critical traffic during congestion, preventing packet loss and ensuring the smooth operation of RDMA communication.

    Powering AI with PFC and Rx/Tx Watermark counters
    Powering AI with PFC and Rx/Tx Watermark counters
    • Quality of Service (QoS): RoCE networks often have specific QoS requirements. These metrics provide data on how well the network adheres to these QoS policies. Monitoring allows network administrators to ensure that RoCE traffic receives the necessary level of service, minimizing latency and optimizing performance.
    • Identifying Bottlenecks: ONES can highlight potential bottlenecks in the network. By monitoring the pause frames and PFC counters, administrators can identify areas of congestion or network inefficiencies that may impact RoCE performance.
    • Real-time Monitoring: Real-time monitoring done by ONES allows for immediate responsiveness to changes in network conditions. In RoCE environments, where rapid data transfers are common, timely identification and resolution of congestion issues contribute to maintaining low latency and high throughput.
    • Performance Optimization: Understanding these metrics enables administrators to optimize the performance of RoCE networks. By analyzing the data, adjustments can be made to network configurations, traffic prioritization, or resource allocation to enhance overall RoCE performance.
    • Capacity Planning: ONES metrics contribute to capacity planning by providing insights into how well the network can handle the current load and whether there is room for expansion. This is crucial for scaling RoCE networks to accommodate growing demands.
    Topology Overview of RoCE Traffic: Nodes representing RoCE devices are connected by lines, showing the flow of RDMA data.
    Figure 1: Topology Overview of RoCE Traffic

    In the RoCE Traffic Topology GUI view, the flow unfolds dynamically, revealing the interconnected pathways of RDMA over Converged Ethernet (RoCE) traffic. Nodes representing devices engaged in RoCE communication are linked by lines indicating the data exchange routes. The graphical representation allows for an intuitive understanding of the network’s structure, emphasizing the direct, low-latency pathways characteristic of RoCE

    Network landscape with PFC enabled interfaces: Blue dots represent RoCE-capable interfaces.
    Figure 2: RoCE Enabled Interfaces

    In the graphical user interface (Figure 2), a visual representation unfolds, showcasing the dynamic network landscape with PFC enabled interfaces. These interfaces, depicted in the intuitive display, highlight the integration of RDMA over Converged Ethernet (RoCE) capabilities. The interfaces identified by a blue dot have the capability to transport RoCE traffic.

    Figure 3 depicts various provisions facilitating RoCE support on a device. In this case, the device is handling L3 lossless traffic on queues 3 and 4 of interface number 51.

    QOS Configuration for RoCE support on a device. Device handles L3 lossless traffic on queues 3 and 4 of interface 51
    Figure 3: QOS Configuration

    Figure 4 below in ONES depicts the distribution of RoCE traffic alongside regular traffic on the interface along with the seamless transmission of lossless data even in congested conditions, revealing the count of pause frames sent/received by the device.

    RoCE Traffic Segregation & PFC Counters: Visualizes RoCE and regular traffic distribution, ensuring lossless data transmission in congested conditions
    Figure 4: RoCE Traffic Segregation  & PFC Counters

    Queue drop counters play a pivotal role in AI Fabrics, offering crucial insights into the network’s performance and reliability. These counters specifically track instances where packets are dropped within the queuing system, providing valuable data for monitoring and optimization

    QoS Drop Counters: Monitors network performance and reliability by tracking dropped packets
    Figure 5: QoS Drop Counters

    Conclusion

    Based on the presented GUI snapshots, it’s evident that ONES offers a captivating visual experience, showcasing intricately designed software crafted explicitly for the AI Fabric on the SONiC platform. ONES doesn’t just fulfill the requirements of contemporary networking; it also enhances user interaction through intuitive visualization and advanced features. This platform signifies an innovative approach to orchestrating and visualizing networks across multiple vendors, delivering a customized solution for addressing the intricate nature of AI Fabric on the SONiC platform.

    What’s next in store for our forthcoming blog series, where we’ll extensively explore these informative topics:

    • Detailed security compliance with ONES
    • In-depth analysis regarding the measurement of NWSLA

    To immerse yourself in SONiC firsthand, visit ONES Center. Delve into a comprehensive case study of SONiC, please check out “Maximizing Success with SONiC”.

    FAQs

    1. What is RoCE and why is it important for AI fabric networks?

     RoCE (RDMA over Converged Ethernet) is a networking technology that enables high-throughput, low-latency communication by allowing direct memory access over Ethernet networks. It is crucial for AI fabric networks because it improves model training speeds, supports efficient data movement, and minimizes latency – all essential for real-time or near-real-time AI workloads.

    ONES 2.0 enhances RoCE traffic monitoring by collecting critical metrics like PFC counters, Rx/Tx watermarks, and QoS drop counters. It enables real-time visibility into traffic prioritization, congestion points, and queue utilization, helping administrators proactively optimize performance, ensure lossless data flow, and maintain low-latency communication across AI workloads.

    Proactive congestion management helps identify and mitigate potential network bottlenecks before they impact performance. In AI workloads involving large datasets and real-time communication, this prevents degradation in model training or inference tasks, ensuring optimal speeds, reliability, and efficient resource utilization.

    ONES 2.0 supports multi-vendor SONiC fabrics by normalizing telemetry metrics and collecting RoCE-related data across various hardware platforms. It integrates with orchestration and third-party APIs (REST, Prometheus), offering centralized visibility, streamlined configuration, and seamless monitoring in diverse AI network environments.

    ONES provides an intuitive GUI for visualizing RoCE traffic flow across nodes, interface-level traffic segregation, QoS configuration, and pause frame statistics. Features like PFC counters, lossless traffic mapping, and queue drop insights help network operators understand and troubleshoot AI fabric performance at a granular level.

    ONES captures real-time prioritization metrics and visualizes queue behaviors. This ensures RDMA operations maintain low latency, even under high data loads—keeping AI training and inference tasks efficient and stable.

    Yes. ONES provides:

    • Historical and live bandwidth utilization
    • Queue drop trends

    Device saturation insights
    These help teams forecast demand, adjust configurations, and scale SONiC-based AI fabrics proactively.

    ONES offers detailed visualization of:

    • Pause frames
    • PFC-enabled interfaces

    L3 lossless queue mapping
    These tools are vital for ensuring zero packet loss in RoCE environments where retransmissions can cripple AI performance.

    ONES connects seamlessly to third-party systems through REST APIs and Prometheus exporters. This enables teams to correlate RoCE telemetry with application metrics and wider infrastructure data, enriching end-to-end observability.

    NetOps, DevOps, and AI infrastructure teams all gain from ONES’s unified telemetry, real-time dashboards, and multi-vendor abstraction—helping them resolve issues faster and align network behavior with AI performance targets.

    Categories
    Open Networking Enterprise Suite SONiC

    New Product Release Announcement: Aviz ONES 2.0 – A Milestone for SONiC Deployments

    ONES (Open Networking Enterprise Suite) stands as a robust network management and supportability solution explicitly designed to tackle the distinctive challenges associated with transitioning to SONiC, an open-source Networking Operating System. Our comprehensive suite serves as a multifaceted Network Orchestration, Visibility, and Assurance platform catering to the complexities of operating multi-vendor and multi-NOS Network Infrastructure.

    ONES, initially conceived as a comprehensive solution tailored for the challenges of SONiC migration, has evolved significantly. With the emergence of ONES 2.0, a remarkable milestone is achieved.

    Aviz ONES 2.0 release emerges as a beacon, making a significant milestone in becoming a genuine multi-vendor SONiC Deployments, operations and AI-Fabric platform.

    6 Key Features that Make Aviz ONES 2.0 a Game Changer:

    1. Deep Insights with Enhanced Observability:

    • Advanced Topology View: Gain insights into underlay, overlay, and RoCE traffic Visibility in AI Fabric with an advanced topology view.
    • Detailed Metrics: Access detailed Protocols, Health and Capacity metrics presented with time series graphs for enhanced visibility.
    • Comprehensive Pages: Navigate through enriched pages for Inventory, Analytics, and Software, ensuring a holistic network view.
    • Expanded Metrics: Unlock additional metrics for devices and interfaces, enhancing analytical capabilities.
    • Quick Access: Swiftly connect with devices through an SSH button in the interface Down Widget.
    • Firmware Insights: Gain deeper insights with a Firmware details widget embedded in the Software page.

    2. Rule Engine & Integrations:

    • Watcher Rules and Alerts: Implement Watcher Rules for devices and interfaces, coupled with configurable metrics.
    • Collaborative Integration: Strengthen collaboration with Zendesk and Slack integrations, ensuring seamless notifications.

    3. Enhanced Compliance & Device Management:

    • Efficient Device Management: Streamline device management with syslog extraction, Console access, and Non-SONiC device controls.
    • Firmware Focus: Access comprehensive firmware information directly in the Device details, enhancing inventory management.
    • Data Handling: Export or download inventory with ease, simplifying data management.

    4. Orchestration:

    • YAML Configuration Templates: Simplify configuration with YAML templates supporting features like BGP, L2/L3 MC-LAG, EVPN MultiHoming, and more.
    • Config Updates: Incremental updates for L2VNI/L3VNI configurations, ensuring efficiency in managing network changes.
    • Backup and Restore: Enjoy enhanced backup and restore options via a user-friendly UI, ensuring network stability.

    5. Network SLA:

    • Performance Monitoring: Ensure optimal network performance with CLI Backend support for Packet Loss and Latency metrics.
    • Endpoint Flexibility: Measure performance between any two end-points using ICMP or TCP, providing flexibility in performance monitoring.

    6. Product Security:

    Regular security scans, robust certificate management, user account management, RBAC implementation, LDAP integration, mutual TLS certificates.

    Conclusion:

    Aviz ONES 2.0 signifies a quantum leap in SONiC supportability, offering advanced monitoring, rule-based alerts, orchestrated configuration updates, and NW SLA monitoring with enhanced UI. Embrace the future of networking with Aviz ONES 2.0 – where innovation meets reliability.

    Keep an eye on this space for our upcoming blog series covering these insightful topics: 

    • ONES 2.0 Release features: Complete guide
    • Rulengine, Alerts, and Notifications
    • RoCE Traffic Visibility in AI Fabric
    • Detailed security compliance with ONES
    • In-depth analysis regarding the measurement of NWSLA

    Contact us for personalized guidance and tailored solutions to optimize your SONiC experience. Our team is here to assist you in navigating the complexities, answering your queries, and exploring the best SONiC solutions suited for your unique requirements.

    FAQs

    1. What are the key enhancements in Aviz ONES 2.0 for multi-vendor SONiC deployments?

    Aviz ONES 2.0 introduces deep observability, rule-based automation, enhanced compliance features, and advanced orchestration tools to simplify SONiC operations across multi-vendor environments. Key improvements include:

    • Advanced Topology View for underlay, overlay, and RoCE traffic visibility.
    • Expanded device and interface metrics with time-series graphs.
    • Watcher Rules & Alerts for proactive network monitoring.
    • YAML-based orchestration for seamless configuration management (BGP, EVPN MultiHoming, etc.).
    • Enhanced security & compliance with RBAC, LDAP integration, and mutual TLS.

    These features make ONES 2.0 a comprehensive AI-Fabric and SONiC operations platform for managing complex networking environments.

    ONES 2.0 provides a deep observability framework that enables real-time insights into multi-vendor SONiC networks. Enhancements include:

    • Detailed protocol health and capacity metrics displayed in a structured UI.
    • Firmware insights widget for software visibility.
    • Expanded inventory, analytics, and software pages for a holistic network view.
    • SSH quick-access button for seamless device troubleshooting.

    These enhancements enable network teams to gain complete visibility into SONiC-based infrastructures, ensuring optimal performance and proactive issue resolution.

    ONES 2.0 introduces robust compliance and security measures to ensure a safe and controlled SONiC environment:

    • Regular security scans to identify vulnerabilities.
    • Comprehensive RBAC (Role-Based Access Control) for precise user management.
    • LDAP integration & mutual TLS certificates for secure authentication.
    • Syslog extraction and centralized device management for compliance auditing.

    By integrating these features, ONES 2.0 ensures that organizations can deploy SONiC securely while meeting stringent compliance standards.

    ONES 2.0 simplifies network orchestration through:

    • YAML-based configuration templates supporting BGP, MC-LAG, EVPN MultiHoming, and more.
    • Incremental updates for L2VNI/L3VNI configurations, ensuring efficient changes.
    • Backup & restore capabilities via UI, minimizing downtime and improving operational efficiency.

    These features help organizations automate SONiC configurations at scale, reducing manual efforts and accelerating deployments.

    ONES 2.0 enhances SLA tracking and network performance monitoring through:

    • Packet loss and latency measurement tools to ensure network health.
    • ICMP and TCP-based endpoint flexibility, allowing performance testing between any two network points.
    • CLI backend support for real-time monitoring and troubleshooting.

    ONES 2.0 delivers enhanced topology views and underlay/overlay mapping that includes RoCE traffic paths. This is critical for AI workloads that need consistent low-latency, lossless Ethernet. Real-time protocol health and congestion insights help operators optimize GPU networking at scale.

    ONES 2.0’s Watcher Rules engine allows teams to:

    • Set threshold-based alerts on device metrics and interface health
    • Monitor protocol states

    Receive proactive notifications via Slack or Zendesk
    This ensures the right teams respond quickly to any network anomalies.

    Yes. ONES 2.0 supports hybrid networks with SONiC and non-SONiC devices. Its extended device management capabilities—like syslog extraction, console access, and firmware tracking—provide full observability across diverse vendor environments.

    By offering YAML-based orchestration, easy UI-driven backups and restores, and incremental configuration updates, ONES 2.0 streamlines day-to-day tasks. NetOps teams deploy changes faster, validate configurations in real time, and maintain SLA compliance with less manual effort.

    ONES 2.0 includes enterprise-grade security features like:

    • RBAC for precise user access control
    • LDAP integration for secure authentication
    • Mutual TLS certificates for encrypted communications
    • Automated security scans for compliance assurance
    Categories
    SONiC

    Why Normalizing SONiC for Multi-vendor Network Infrastructure is Important and How to Do It?

    A multi-vendor network strategy is a forward-thinking approach that enables organizations to harness the full potential of SONiC. It revolves around the adoption of SONiC as a common NOS while deploying hardware sourced from multiple vendors within the same infrastructure. One of its key advantages is the flexibility; organizations can select the best-suited hardware to meet specific requirements without being constrained by a single-vendor solution. Additionally, the strategy encourages healthy competition among vendors, leading to better pricing, feature innovation, and support options.

    While a multi-vendor network strategy brings a multitude of benefits to organizations seeking consistency and efficiency in their network deployments, it also entails normalization of SONiC, its operational data, tools, and processes to ensure a unified and standardized environment is available for administrators and operators. In this blog, we will define and elaborate on why and how to normalize SONiC across diverse hardware platforms and foster interoperability to achieve greater flexibility and agility in your network infrastructure.

    Why Normalize SONiC for Multi-Vendor Networks: Overcoming Challenges and Maximizing Benefits

    Today, every forward looking networking team, regardless of their size, is looking to increase the capacity and performance of their network infrastructure. Everyone wants to embrace the latest advancements in ASIC functionalities, and deploy the latest hardware and software that allows them to scale and control their infrastructure on their own terms. But relying on a single vendor for hardware can lead to lock-in, where a company becomes overly dependent on that vendor’s products, services, and ecosystems.

    While there are clear advantages to avoiding dependence on a single hardware vendor, it’s also important to note that managing multi-vendor deployments can bring its own set of challenges, such as increased complexity in network management, integration, and support. SONiC brings the layer that creates the balance between the benefits of vendor diversification with the operational challenges of a multi-vendor network strategy. But, even SONiC presents several challenges:

    1. Management Complexity: Vendors typically have their own versions, configurations, management tools, and interfaces for SONiC. This heterogeneity complicates infrastructure management and monitoring.
    2. Compatibility Risks: As each vendor updates their version of SONiC, there is increased risk of introducing compatibility issues with other components in the overall network.
    3. Integration Hurdles: The variance in SONiC versions from each vendor often leads to integration issues resulting from configuration/telemetry inconsistencies across platforms.
    4. Support Variability: Different vendors offer varying levels of support, both in terms of quality and responsiveness, this can make troubleshooting issues more complicated.
    5. Training Demands: Operators might need training for vendor specific hardware and software nuances, which can lead to longer onboarding times.

    Hence, for consistent behavior and operability of using SONiC across different hardware platforms, normalization is essential. Normalization brings:

    1. Single Quality Standards & Consistent Visibility: To ensure that the OS provides consistent behavior and features regardless of the underlying hardware, and operational metrics are consistent across different platforms.
    2. Ease of Deployment: To ensure operators can expect similar configurations, behaviors, and deployment processes across different hardware.
    3. Standardized Troubleshooting: To ensure the environment emits consistent logs and alerts regardless of the hardware it runs on.
    4. Common Training & Documentation: To ensure network professionals have access to common and consistent resources and behaviors don’t vary based on the underlying hardware.
    5. Community Collaboration: To ensure that effective collaboration can occur between the users, the vendors, and the open-source SONiC community.

    In essence, normalization of SONiC ensures that its benefits are fully realized without introducing unnecessary complexities and inconsistencies in the network operations. More importantly, normalization reinforces that the NOS layer doesn’t lean too heavily towards any particular hardware vendor’s idiosyncrasies.

    Achieving SONiC Consistency: Best Practices for Multi-Vendor Networks

    Achieving consistency and interoperability with SONiC across multiple hardware platforms requires a combination of best practices, rigorous testing, purpose-built tools, and unified support, together referred to as normalization of SONiC. Here are some steps, considerations, and recommendations to ensure a unified experience for multi-vendor SONiC deployments, and to help maximize the TCO savings from adopting SONiC.

    1. Comprehensive SONiC Evaluation
      Evaluating SONiC requires a comprehensive approach that considers both its technical capabilities and viability within your network infrastructure. Each organization has their own requirements for routing protocols, throughput, latency, reliability, telemetry, etc., including costs. You should start with creating a list of requirements that meets your network’s demands, and then look for options that fit your bill. Many organizations prefer POCs for such evaluations, but POCs can be expensive and time consuming, especially when they entail procurement of multiple hardware from different vendors. 

      At Aviz, we identified this problem, and created what we call the Open Networking Experience Center for SONiC (ONE Center). ONE Center is a lab environment designed to showcase the capabilities of SONiC across a wide range of hardware. It gives you hands-on experience with SONiC and an avenue for doing SONiC evaluations.

      In real-world scenarios without upfront investment into hardware. Aviz even provides SONiC Test reports specific to your evaluation criterias highlighting viable choices of Switches that may meet your needs.
    2. Rigorous Testing of SONiC Builds
      Contrary to popular belief, the community SONiC may not be the right choice for your organization. While the community version provides all the necessary baseline features, oftentimes your exact feature requirements can only be met by vendor specific distros. This is very typical of any open-source software as the overall progress of the community hinges upon the contributions from community members and vendors alike. 

      Hence, it is important to identify what version will work for you, but it is even more important to perform rigorous testing before and after deployment. We recommend testing SONiC builds for functionality, performance, and scalability across your selected platforms, specifically for the use cases pertaining to you. At Aviz, we have developed a test suite to ensure that quality standards for SONiC meet the expectations of our customers.

      Our Fabric Test Automation Suite (FTAS) is designed to verify SONiC for Data Center and Edge deployment readiness. Moreover, FTAS is a fully automated CI/CD based solution ready to plug into existing network infrastructures for pre, and post deployment testing scenarios. We have helped multiple customers in establishing a vendor-agnostic unified SONiC build pipeline that can be rigorously tested before and after deployment. In some cases, we even manage the customer GitHub repositories that host the builds normalized for our customer use cases.
    3. Unified Configuration Management
      Deploying SONiC in multi-vendor environments makes it critical to have a unified configuration management system that ensures network administrators can seamlessly deploy, modify, and maintain their configurations across multiple hardware platforms. 

      Leveraging tools like YANG models, organizations can standardize SONiC configurations and data structures, presenting a consistent interface to their teams. Furthermore, a unified approach helps in streamlining troubleshooting and auditing activities, eliminating inconsistencies that might arise from platform-specific configurations. 

      At Aviz, we have taken it one step further by developing a unified SONiC Fabric Manager, a part of our Open Networking Enterprise Suite (ONES) solution, which can help orchestrate and configure multi-vendor SONiC networks with ease. 

      ONES can create, configure various topology configurations for Leaf, Spine, and Super-spine layers, apply them, and validate them on your entire fabric automatically. It even allows you to compare running configs against your applied configs. Additionally, ONES boasts features such as ZTP that can help upgrade your switches with custom SONiC images, something that is extremely critical for keeping your SONiC versions up-to-date.
    4. Standardized SONiC APIs for Integration
      While the Switch Abstraction Interface (SAI) at the core of SONiC is designed to ensure a consistent API layer across various ASIC vendors and the NOS, unfortunately, the platform components that make up the switch, bring their own nuances, leading to inconsistencies that often introduce complexities in integrating SONiC with various NetOps tools. 
      Standardized APIs across SONiC versions play a pivotal role in ensuring streamlined NetOps integration to deliver management agility, automation, and unified analytics. Standardizing APIs across a diverse range of hardware platforms with multiple variants of SONiC running on them can be extremely tedious, and hence a deterrent to your integration requirements. This is another area where we have put in a lot of effort to deliver Standardized APIs for nearly every variant of SONiC to our customers. 

      Our ONES platform has built-in REST APIs for both Day 1 and Day 2 operations. These REST APIs can easily be integrated with your NetOps tools, eliminating the need for you to invest your time and resources into Standardizing APIs for your SONiC operations.
    5. Unified Monitoring System Across Platforms for SONiC Deployments
      As organizations move towards adopting open-source SONiC over proprietary networking solutions, the one thing that everyone seeks is a single-pane-of-glass for monitoring their multi-vendor infrastructure. Everyone expects a Cisco/Arista grade experience not only because of the ease of use they brought, but also because no one wants the added complexity of using multiple systems for each of their vendor specific components.

      In an environment where SONiC operates across various hardware vendors, the potential for disparate monitoring tools and fragmented visibility is high. A cohesive visibility across all hardware components simplifies monitoring, troubleshooting, and promotes proactive network management.

      Aviz ONES offers administrators a unified, comprehensive view of their entire SONiC network landscape. It streamlines workflows, eliminating the need to juggle multiple vendor-specific tools, maximizing operational efficiency. In essence, a single-pane-of-glass approach maximizes and ensures consistent, high-quality network performance across diverse SONiC deployments.
    Open Networking Enterprise Suite (ONES) by Aviz Networks
    ONES overview
    1. Common Support Channel Across Vendors (NVIDIA, Cisco, Edgecore, Marvel…)
      No one wants to deal with multiple support channels for their infrastructure. The decision to go multi-vendor SONiC brings the critical challenge of having to deal with multiple support teams, ticketing tools, processes, timelines, SLAs, and more. In the world of SONiC, where hardware from various vendors coexist, the ability to receive consistent, reliable, and comprehensive support across all vendors is of paramount importance. A unified support ensures that network administrators have a single channel to raise issues, or seek guidance regardless of who is the supplier of their hardware. 

      At Aviz, we have created that unified support channel for SONiC by working tirelessly with our customers for years and collaborating with nearly every major ASIC and Switch vendor in the industry. Our team of SONiC experts guarantees that the nuances and idiosyncrasies of each vendor’s platform is well-understood and addressed timely for our customers. 

      We are proud to have partnered with the likes of Broadcom, Celestica, Cisco, Edgecore, Marvell, Micas, Nvidia, Supermicro, Wistron, and many more to deliver the much needed unified support for the SONiC ecosystem. 

      In most cases, our global teams, available 24×7, are able to meet SLAs as low as 30 minutes, since our contracts with hardware vendors have enabled us to provide a seamless experience. Additionally, with all the experience and expertise we have amassed over the years, we have been able to automate the SONiC troubleshooting via our carefully crafted runbooks that are normalized across multiple platforms. 

    Conclusion

    As organizations embrace SONiC for their multi-vendor networks, achieving consistency becomes paramount. To ensure seamless migration to, and interoperability with SONiC:

    1. Conduct a comprehensive end-to-end evaluation of hardware options tailored to your needs
    2. Choose the right SONiC distributions, and ensure rigorous testing is performed
    3. Utilize a unified configuration management system to streamline deployments
    4. Utilize standardized APIs to facilitate efficient NetOps integrations
    5. Utilize unified monitoring solutions for multi-vendor SONiC management
    6. Identify a centralized support partner that can collaborate with all hardware vendors

    Adopting these strategies will maximize the benefits of multi-vendor SONiC deployments.

    Contact us today, how we can support your networking needs

    FAQs

    1. What are the challenges of managing multi-vendor SONiC networks?

    Managing multi-vendor SONiC networks introduces challenges such as management complexity, compatibility risks, integration hurdles, and varying levels of vendor support. These issues arise from differences in SONiC versions, configurations, and vendor-specific tools, which can complicate monitoring, troubleshooting, and scaling across a diverse hardware environment.

    Normalizing SONiC ensures consistent behavior, quality standards, and operational visibility across diverse hardware platforms. It reduces the complexities of managing different configurations, improves troubleshooting efficiency, and enables a unified experience for operators. Normalization fosters interoperability, ensuring that the SONiC environment works seamlessly across different vendors and devices.

    Normalizing SONiC allows organizations to avoid vendor lock-in by ensuring that the network operates consistently across hardware from multiple vendors. This enables companies to select the best hardware for their needs without being tied to a single vendor’s ecosystem, fostering flexibility and driving cost-effectiveness and innovation.

     Best practices for normalizing SONiC include:

    • Conducting comprehensive SONiC evaluations tailored to your network requirements.
    • Rigorous testing of SONiC builds to ensure compatibility and performance across different platforms.
    • Implementing a unified configuration management system using tools like YANG models.
    • Standardizing SONiC APIs for integration with NetOps tools.
    • Utilizing a unified monitoring solution to ensure consistent visibility across the network.
    • Partnering with a centralized support channel for all hardware vendors to streamline troubleshooting.

    A unified configuration management system simplifies network administration by ensuring consistent configurations across different hardware platforms. It streamlines deployments, troubleshooting, and auditing activities, reducing the complexity of managing platform-specific configurations and ensuring a more efficient network operation.

    Standardized SONiC APIs ensure consistent integration with NetOps tools across different hardware platforms. By eliminating inconsistencies in the API layer, these standardized interfaces provide seamless management, automation, and analytics capabilities, improving network efficiency and agility.

    A unified monitoring system provides a single-pane-of-glass view of the entire multi-vendor SONiC network. It simplifies monitoring, troubleshooting, and proactive network management by eliminating the need for multiple vendor-specific tools and enhancing operational efficiency across diverse hardware platforms.

    Rigorous testing ensures that SONiC builds meet the specific functionality, performance, and scalability requirements of the organization. By thoroughly testing SONiC across different hardware platforms, organizations can identify and resolve potential issues before deployment, ensuring a smooth and reliable deployment process.

    The ONE Center is a lab environment designed to showcase SONiC’s capabilities across a wide range of hardware. It allows organizations to evaluate SONiC in real-world scenarios without the upfront investment in hardware. It provides hands-on experience and generates SONiC test reports to help organizations make informed decisions about the best hardware for their network needs.

    Aviz Networks offers solutions like the Open Networking Enterprise Suite (ONES) and the Fabric Test Automation Suite (FTAS) to support the normalization of SONiC. These tools help with configuration management, testing, integration, monitoring, and provide a unified support channel across various vendors. With Aviz’s expertise, organizations can streamline their multi-vendor SONiC deployments and ensure consistent performance and support.

    Categories
    Fabric Test Automation Suite SONiC

    FTAS 2.0: Enabling Customer Driven Quality Standards for Multi-Vendor SONiC Deployments

    The Importance of Quality Standardization in Community SONiC Based Networks

    While traditional network deployments rely on proprietary solutions from switch vendors for quality assurance, the shift towards community SONiC presents a challenge: maintaining high-quality standards without turning SONiC into a proprietary system. The solution? FTAS (Fabric Test Automation Suite).

    FTAS empowers customers to manage the diversity of community SONiC with a simple, one-click quality check. It enables data centers to transition to Community SONiC confidently, ensuring that any vendor’s switch aligns perfectly with their specific requirements through standardized and automated FATS test cases.

    The evolution of FTAS from its initial focus on a few vendors and use cases to the more comprehensive FTAS 2.0 highlights our commitment to expanding support and adapting to a wider range of customer needs. This advancement underscores our dedication to fostering standardized quality assurance in the Community SONiC ecosystem.

    FTAS (Fabric Test Automation Suite) Overview

    Overview of Aviz Networks’ Fabric Test Automation Suite (FTAS)

    Four-device topology for testing scenarios in FTAS

    The Fabric Test Automation Suite by Aviz Networks is a groundbreaking testing solution, combining the consistency of SONiC’s CLI with advanced testing methodologies. It represents a significant advancement in switch testing, catering to the complex demands of modern network fabrics with a commitment to quality and performance. Following are a few innovations we have done with FTAS.

    • Comprehensive Testing: Using SONiC CLI: FTAS leverages the uniformity of SONiC’s Command Line Interface across various vendors, enabling extensive and thorough testing of switch functionalities. This ensures a robust validation of core switch operations within complex network infrastructures.
    • Integration with Jenkins for CI/CD Testing: The suite is compatible with Jenkins, facilitating a ready-to-use Continuous Integration/Continuous Deployment (CI/CD) pipeline tailored for Network Development Operations (NetDevOps). It supports testing on both physical switches and GNS3-based virtual appliances, enhancing its versatility as a true virtual system.
    • In-depth Fabric Analysis: FTAS is specifically designed for detailed analysis of switch behavior within network fabrics. It uses SONiC’s consistent CLI to explore and understand intricate fabric interactions, highlighting the suite’s capability for detailed network scrutiny.
    • Tailored for Modern Network Needs: The suite is developed with a focus on the evolving requirements of contemporary networks. It offers a flexible and adaptable toolset, poised to address the dynamic nature of modern network technology landscapes.
    • Focus on Performance, Reliability, and Scalability: Aviz Networks has a clear mission with FTAS – to enhance network performance, reliability, and scalability through rigorous and comprehensive testing. This focus ensures networks are not just meeting but exceeding performance expectations.
    • Commitment to On-going Excellence: FTAS embodies Aviz Networks’ dedication to providing high-quality testing solutions. It ensures optimal network performance and resilience, showcasing the company’s commitment to excellence in network testing.

    Optimizing Test Cases and Quality Quest Across the Entire SONiC Ecosystem with Major Vendors

    Open Networking Experience (ONE Center): SONiC evaluation sandbox for diverse hardware.

    Aviz Networks has always taken an inclusive and comprehensive approach working with all of our partners. Community comes with diversity and with that it comes with a lot of heavy lifting which is required to do normalization for complex test cases. Below is the high level process we used to achieve a standardized test suite which can now work on all the vendors registered in ONE center, and hence available for customers to to POC or setup a CI/CD lab of their own and start on NetDevOps journey.

    • Selective Test Case Automation: The team at Aviz Networks embarked on an automation journey, carefully selecting key test cases for automation. This selection process was thorough, ensuring that the chosen test cases were crucial for the automation objectives – typically selected by high value customer use cases.
    • Rigorous Cross-Platform Testing: The automated test cases were subjected to intensive testing across various platforms available in the Aviz lab. This included a range of platforms from industry leaders such as Cisco, NVIDIA, Edgecore, Wistron, and others, ensuring a comprehensive testing process.
    • Platform Independent (PI) Test Suite: A significant number of these test cases were identified as platform-independent. They demonstrated consistent behavior across all tested platforms. These were grouped into the Platform Independent suite, making them versatile for execution on any platform within the ecosystem.
    • Identification of Platform Dependent (PD) Test Cases: Some test cases showed variations in behavior across different platforms. This was due to differences in the outputs of show commands or the support of different feature sets by various switches. These test cases were categorized as Platform Dependent, acknowledging their specific applicability to certain platforms.

    In summary, the optimization of test cases across the SONiC ecosystem at Aviz Networks involved a strategic selection and categorization of test cases, ensuring effective automation across a wide range of platforms. This process led to the creation of both Platform Independent and Platform Dependent test suites, accommodating the diverse nature of the testing environment and the specific characteristics of different platforms.

    Customer Guide to Accessing and Using FTAS

    FTAS serves a dual role, catering to both switch vendors and end customers with its versatile applications. It’s a dynamic tool, continuously evolving to incorporate the latest in SONiC community developments.

    For Switch Vendors

    • R&D and Product Development: Utilized in the research and development phase, FTAS aids in crafting high-quality SONiC solutions.

    For End Customers

    • Proof of Concept (POC) Testing: As a subscription service, it enables end customers to conduct POCs. This facilitates testing of multi-vendor SONiC solutions, helping in informed decision-making regarding vendor selection.
    • CI/CD Lab Management: FTAS is also instrumental in managing Continuous Integration/Continuous Deployment (CI/CD) labs, streamlining NetDevOps processes for efficient SONiC deployment.

    Continuous Enhancement

    FTAS is not static; it’s a ‘living entity’ that regularly integrates new use cases, platforms, and updates from the SONiC community. This ensures users always have access to the most advanced and comprehensive tools available.

    If you are looking for a comprehensive and easy-to-use testing solution for your SONiC network, then FTAS is the perfect option for you. For more detailed information about the supported suites and their configurations, visit: FTAS Test Suite Configuration


    FTAS Reporting Mechanism: Delivering Comprehensive Test Insights

    Test Execution report- user-friendly HTML format with detailed test results and logs for transparent and efficient testing experience
    Overview of the FTAS Test Results – Execution Report

    The Next Level of Testing: Introducing FTAS 2.0 and its Exciting Features

    Working with customers and partners, we focused on below three items in FTAS release 2.0: 

    • Making it more robust for the existing use cases and adding new test cases as per new customers
    • Adding more platforms for higher speeds and feeds and also new ASICs/Platforms
    • Making the test cases more flexible so custom test cases can be executed

    Detailed list for 2.0 release can be found here

    Conclusion

    FTAS is an invaluable asset for corporations exploring SONiC or switch vendors seeking enhanced support. It revolutionizes testing by offering comprehensive coverage with reduced effort and expense. By eliminating the need for extensive teams, complex test planning, and manual execution, FTAS streamlines the testing process. It ensures accurate, efficient outcomes at a fraction of the traditional cost and effort.

    Ready to explore? Book a demo with us on FTAS, and how it works?

    FAQs

    1.How does FTAS standardize quality testing across different SONiC vendors?

    FTAS standardizes quality testing by using SONiC’s consistent Command Line Interface (CLI) to automate test cases across different switch vendors. It provides a platform-independent suite of test cases, ensuring that switches from various vendors can be tested under the same quality standards, leading to a seamless and interoperable multi-vendor SONiC deployment.

    Platform Independent (PI) test cases are those that perform consistently across different platforms, while Platform Dependent (PD) test cases are tailored to specific platforms and may show variations in behavior due to vendor-specific implementations. FTAS categorizes test cases into PI and PD to ensure accurate testing across diverse platforms within the SONiC ecosystem.

    FTAS is continuously updated to incorporate the latest SONiC community developments. It regularly adds new use cases, platforms, and test cases, ensuring that users always have access to the most advanced tools. This ongoing enhancement ensures that FTAS remains adaptable to the ever-evolving requirements of modern network technologies.

    FTAS provides user-friendly, detailed test execution reports in HTML format, offering transparency and efficiency in understanding test results. These reports include comprehensive logs, making it easy for users to track performance, identify issues, and ensure that SONiC deployments meet the necessary quality standards.

    FTAS 2.0 introduces several new features, including enhanced support for existing use cases, additional platforms for higher speeds and new ASICs, and improved flexibility for executing custom test cases. These updates ensure that FTAS 2.0 can address the evolving needs of customers and vendors, supporting more advanced and diverse SONiC deployments.

    6. What is FTAS 2.0 and how does it improve SONiC deployments?

    FTAS 2.0 is an enhanced version of the Fabric Test Automation Suite, designed to ensure consistent, high-quality performance across multi-vendor SONiC environments. It automates and standardizes test cases, providing detailed insights into switch behavior and fabric interactions. FTAS 2.0 improves upon the original by supporting more platforms, adding new test cases, and allowing greater flexibility for custom testing.

     FTAS simplifies the testing and quality assurance process for multi-vendor SONiC deployments by providing a one-click quality check for any vendor’s switch. It ensures that switches meet specific requirements through standardized, automated FATS test cases, making it easier for data centers to transition to SONiC confidently.

    Integrating FTAS with Continuous Integration/Continuous Deployment (CI/CD) pipelines streamlines testing within the NetDevOps process. This integration ensures continuous testing of SONiC deployments, enabling automated validation and faster identification of issues. FTAS is compatible with Jenkins, facilitating automated testing across both physical switches and virtual appliances.

     For switch vendors, FTAS supports research and development by validating SONiC solutions before deployment. It ensures the solutions meet the quality standards required for successful multi-vendor SONiC deployments. For end customers, FTAS enables Proof of Concept (POC) testing, which aids in informed decision-making when selecting vendors and evaluating SONiC solutions.

    FTAS plays a crucial role in NetDevOps by automating the testing and validation of SONiC deployments. It integrates seamlessly into CI/CD pipelines, allowing customers to conduct continuous testing, track quality metrics, and enhance their network operations with minimal manual intervention. This ensures that network deployments are efficient, reliable, and scalable.

    Categories
    SONiC

    The Old NOS is Dead! Long live the SONiC King!

    What happened to the trailblazers in SDN?

    The software-defined networking (SDN) craze began with the promise of a disaggregated Network Operating System (NOS). The ability to have hardware and silicon diversity, with carrier-class network performance and resiliency, took flight.

    Vyatta, Pica8, Pluribus, and Cumulus are some of the companies that emerged when SDN was new, and OpenFlow and OpenStack were all the rage.

    SDN attempts to reshape the Intersection between NetOPs and DevOPs

    Programmatic capabilities and CLIs driving network transformation towards 'as code' approach

    Programmatic capabilities and command-line interfaces (CLIs) are becoming increasingly popular. The network and its infrastructure are being influenced by the DevOps movement and are transitioning to an “as code” approach.

    As infrastructure starts to virtualize, DevOps teams are building overlays for their own rapid service expansion on top of the existing network. This has led to the creation of VXLAN. The worlds of NetOps and DevOps have collided as a result of this trend.

    The convergence of NetOps and DevOps occurs due to widespread infrastructure virtualization, giving rise to VXLAN

    Ring a bell?

    Now, let’s talk about the fate of those NOS players…

    • Vyatta – Acquired.
    • Pica8 – Receivership. Assets Acquired. Best of luck.
    • Pluribus – Acquired. Your sole hardware choice.
    • Cumulus – Acquired. Broadcom support discontinued. A familiar tale, just like Pluribus.

    Shifting Tides in the NOS Space: The Evolution and Future of SONiC with Microsoft and the Linux Foundation

    SONiC, alternatively known as Azure Software for Open Networking in the Cloud (SONiC), a cross between NetOps and DevOps, SONiC emerged unexpectedly from the heart of Microsoft, a source that might have been the least anticipated.

    Thank you to the Microsoft Azure team! 🙂

    Microsoft has handed over the development of SONiC to the Linux Foundation and Open Compute Platform (OCP). This move ensures that SONiC will remain open-source and will not be acquired or end-of-lifed. Ever. 

    The Linux Foundation will focus on the software component of SONiC and will work with OCP to align hardware and specifications like SAI.

    SONiC’s Rise to Dominance: Navigating the Competitive Landscape of Disaggregated NOS in the Network Market

    Initially, there were some reservations about SONiC; however, it has since become a dominant force powering many platforms, including Azure, Google, and META, just to name a few. At this point, SONiC momentum is unstoppable!

    It seems like the disaggregated Network Operating System (NOS) market has become highly competitive, with some NOS struggling to survive. The market is similar to a game of “NOS Thrones,” where only the most capable NOS will survive and claim market share dominance. This competition is not limited to data centers but also extends to hyperscalers and the edge (both access and the RAN/ORAN Cell Site).

    How to address the rapidly changing and disappearing NOS space with a long term solution?

    According to a report by 650 Group, the worldwide market for SONiC in data center switching is expected to exceed $4 billion in revenue by 2025. The report also suggests that SONiC’s role outside of hyperscalers will increase during the forecast period.

    The big technology conglomerate like Cisco has adopted SONiC on their 8000 series hardware and has partnered with Aviz Networks to support it for their customers. It is predicted that SONiC will have a market share of 40% by 2025, which is second only to Cisco.

    So the question is not “What do I do?”, but as Nick Fury would say, “What are you prepared to do?”! Let’s discuss the most effective solution to the Cumulus Broadcom problem and other disappearing NOS.

    You can either choose:

    a) Another NOS that can perform the required tasks with some degree of uncertainty.

    or

    b) Opt for a more reliable alternative that includes Management, Automation, Orchestration, AND Observability, along with Support for the NOS in one package that can go the distance, such as Aviz with SONiC.

    From Setbacks to Success: Embracing Aviz Networks + SONiC for Trustworthy NOS Solutions

    Hear ye, Hear ye, The old NOS are Dead! Long Live The SONiC King!

    The new SONiC King replaces outdated NOS

    I invite you to join me for a unique blog series. 

    You may recall my previous episodes in the Pica8 NOS land, where the receivership left customers in the lurch for months. I lived through it, and that Pica8 sting is still fresh. Similarly, Cumulus left a sting that still burns.

    Let’s turn that burn into a raging success! How do you regain trust after a bad situation? The same way you always do. Get on top of a better solution! The better, future-proof solution is Aviz Networks + SONiC.

    Aviz Networks and SONiC collaboration for network solutions

    I’ve gained invaluable insights, and from my experience, I can attest it’s never too late to make the right choice.

    Join me in this Blog Series and my exciting journey into Digital Transformation with SONiC at Aviz Networks.

    Discover the superpowers you never knew you had, simplifying and rapidly integrating SONiC into your ecosystem, whether you’re dealing with HyperScalers or others!

    Let me take you on the adventure that will supercharge your digital transformation into a performance-driven, budget-saving, AI-powered machine, all on a smaller budget. We will show you how to do more with less. Stay tuned for Episode 2! 😊

    FAQs

    1. Why are traditional NOS vendors like Cumulus and Vyatta disappearing from the market?

    Traditional NOS vendors struggled to deliver the full vision of open networking. Most offered partial disaggregation with proprietary software stacks, which led to vendor lock-in. Acquisitions and lack of future-proof strategies eventually pushed these vendors out, leaving customers unsupported and searching for alternatives.

    SONiC is a fully open-source, community-driven NOS originally developed by Microsoft and now maintained by the Linux Foundation and OCP. It supports true disaggregation, vendor neutrality, and consistent evolution. Unlike other NOS vendors, SONiC cannot be acquired or end-of-lifed, making it a reliable long-term solution.

    SONiC stands out due to its wide adoption (used by Azure, Google, Meta), support from major vendors (like Cisco), and its open-source nature. It delivers high availability, scalability, and security while avoiding the vendor lock-in associated with other disaggregated NOS providers

    Yes, Community SONiC undergoes one-click automated validation to confirm reliability.It is tested for scalability and performance across multiple real-world scenarios. Certification includes:

    • Detailed test reports 
    • Covering everything from Layer 2 and 3 to platform scaling

    With legacy NOS platforms being phased out and market dominance shifting, now is the optimal time to migrate to SONiC. Enterprises benefit from lower TCO, vendor flexibility, open-source innovation, and a robust support ecosystem—especially when paired with Aviz Networks’ solutions.

    Many early SDN companies offered partial disaggregation or tied their NOS to proprietary hardware. Without true openness or strong community backing, they struggled to scale and were eventually acquired or shut down—leaving customers stranded.

    SONiC stands out because it is:

    • Fully open-source and Linux Foundation-governed
    • Supported by hyperscalers and leading vendors

    Hardware-agnostic with production deployments at global scale
    This shields it from the “acquire-and-sunset” problem that plagued earlier NOS vendors.

    SONiC aligns with modern DevOps approaches by offering standardized APIs, modular architecture, and automation-ready tools. This allows teams to apply infrastructure-as-code principles, integrate VXLAN overlays, and deploy faster with fewer errors.

    Yes. SONiC’s flexible design and wide ASIC support make it a strong choice for edge sites, retail branches, and telco access. Coupled with ONES 2.0, organizations gain robust observability, automation, and compliance for reliable operations.

    Aviz provides vendor-neutral support, onboarding tools, deep observability, and orchestration with ONES—ensuring a smooth migration to an AI-ready, open networking architecture.

    Contact Us

    The Old NOS is Dead! Long live the SONiC King!

    What happened to the trailblazers in SDN? The software-defined networking (SDN) craze began with the promise of a disaggregated Network Operating System (NOS). The ability to have hardware and silicon diversity, with carrier-class network performance and resiliency, took flight. Vyatta, Pica8, Pluribus, and Cumulus are some of the companies that emerged when SDN was new, […]