Exciting Announcement! In celebration of launching our AI Certification, we’re thrilled to offer a 50% discount exclusively. Seize this unique chance—don’t let it slip by!

Categories
All

Aviz ONES 2.0: Closing in on the Reality of SONiC-based AI Fabrics

As technology advances, several trends are emerging in the application of Generative AI for networking, paving the way for more intelligent and adaptive network infrastructures. Some notable trends include Predictive Network Analytics, AI-Enhanced QOS, Network Resource Optimization, Anomaly Detection, Simulation of Realistic Network Environments, Autonomous Network Operations. RoCE (RDMA over Converged Ethernet) can address several challenges posed to networking devices in the context of Generative AI. 

This serves as the foundation for the AI fabric due its improved model training speed, optimized and reliable data movement and its compatibility with Ethernet networks. Effective monitoring of RoCE traffic becomes instrumental in maintaining seamless operations. 

Another important technique, proactive congestion management is crucial for maintaining optimal performance, reliability, and efficiency. AI workloads often involve the exchange of large datasets and real-time communication between nodes. Network congestion can lead to performance degradation, slowing down data transfers and compromising the responsiveness of AI applications. By identifying and addressing potential congestion points before they impact performance, proactive congestion management helps prevent degradation in the performance of generative AI tasks. This ensures that AI models can operate at optimal speeds, meeting the demands of real-time or near-real-time processing needs.

 AI Fabric Insight- AI workload with GPU and DPU.
AI Fabric Insight

ONES – Crafted for SONiC based AI Fabric

In the ever-evolving realm of generative AI networks, where the need for high-performance and low-latency communication takes center stage, ONES 2.0 is set to redefine network optimization. This latest release presents a state-of-the-art solution meticulously crafted to streamline network operations. ONES seamlessly incorporates advanced features such as Priority Flow Control (PFC) counters for RoCE support, and proactive congestion management based on port and per port queue utilization details. ONES supports the collection of the metrics aiding the SONiC-Fabrics with AI across multiple vendor platforms offering excellent scalability support and powerhouses the data collection process. It also seamlessly integrates with the ONES ecosystem – orchestration, visibility, and support for third-party APIs including REST and Prometheus – offering the go-to solution for streamlined management, comprehensive monitoring, and flexible interoperability in complex network environments.

ONES Unveiling SONiC AI Fabrics & RoCE: A Visual Exploration 

ONES collects a set of valuable metrics that is instrumental in monitoring RoCE (RDMA over Converged Ethernet) as it provides insights into the flow control mechanisms and helps ensure the efficient and reliable communication of RoCE-enabled networks.  

How Does Metric Collection Empower AI Fabrics to Tackle Challenges?

  • Traffic Prioritization: These metrics reveal how different types of traffic are prioritized in the network. In RoCE, where low-latency communication is crucial, the ability to prioritize traffic ensures that RDMA operations and other critical data transmissions are given precedence.
  • Congestion Management: Help in monitoring and managing network congestion. RoCE networks can experience congestion, and PFC allows for the pause of non-critical traffic during congestion, preventing packet loss and ensuring the smooth operation of RDMA communication.

Powering AI with PFC and Rx/Tx Watermark counters
Powering AI with PFC and Rx/Tx Watermark counters
  • Quality of Service (QoS): RoCE networks often have specific QoS requirements. These metrics provide data on how well the network adheres to these QoS policies. Monitoring allows network administrators to ensure that RoCE traffic receives the necessary level of service, minimizing latency and optimizing performance.
  • Identifying Bottlenecks: ONES can highlight potential bottlenecks in the network. By monitoring the pause frames and PFC counters, administrators can identify areas of congestion or network inefficiencies that may impact RoCE performance.
  • Real-time Monitoring: Real-time monitoring done by ONES allows for immediate responsiveness to changes in network conditions. In RoCE environments, where rapid data transfers are common, timely identification and resolution of congestion issues contribute to maintaining low latency and high throughput.
  • Performance Optimization: Understanding these metrics enables administrators to optimize the performance of RoCE networks. By analyzing the data, adjustments can be made to network configurations, traffic prioritization, or resource allocation to enhance overall RoCE performance.
  • Capacity Planning: ONES metrics contribute to capacity planning by providing insights into how well the network can handle the current load and whether there is room for expansion. This is crucial for scaling RoCE networks to accommodate growing demands.
Topology Overview of RoCE Traffic: Nodes representing RoCE devices are connected by lines, showing the flow of RDMA data.
Figure 1: Topology Overview of RoCE Traffic

In the RoCE Traffic Topology GUI view, the flow unfolds dynamically, revealing the interconnected pathways of RDMA over Converged Ethernet (RoCE) traffic. Nodes representing devices engaged in RoCE communication are linked by lines indicating the data exchange routes. The graphical representation allows for an intuitive understanding of the network’s structure, emphasizing the direct, low-latency pathways characteristic of RoCE

Network landscape with PFC enabled interfaces: Blue dots represent RoCE-capable interfaces.
Figure 2: RoCE Enabled Interfaces

In the graphical user interface (Figure 2), a visual representation unfolds, showcasing the dynamic network landscape with PFC enabled interfaces. These interfaces, depicted in the intuitive display, highlight the integration of RDMA over Converged Ethernet (RoCE) capabilities. The interfaces identified by a blue dot have the capability to transport RoCE traffic.

Figure 3 depicts various provisions facilitating RoCE support on a device. In this case, the device is handling L3 lossless traffic on queues 3 and 4 of interface number 51.

QOS Configuration for RoCE support on a device. Device handles L3 lossless traffic on queues 3 and 4 of interface 51
Figure 3: QOS Configuration

Figure 4 below in ONES depicts the distribution of RoCE traffic alongside regular traffic on the interface along with the seamless transmission of lossless data even in congested conditions, revealing the count of pause frames sent/received by the device.

RoCE Traffic Segregation & PFC Counters: Visualizes RoCE and regular traffic distribution, ensuring lossless data transmission in congested conditions
Figure 4: RoCE Traffic Segregation  & PFC Counters

Queue drop counters play a pivotal role in AI Fabrics, offering crucial insights into the network’s performance and reliability. These counters specifically track instances where packets are dropped within the queuing system, providing valuable data for monitoring and optimization

QoS Drop Counters: Monitors network performance and reliability by tracking dropped packets
Figure 5: QoS Drop Counters

Conclusion

Based on the presented GUI snapshots, it’s evident that ONES offers a captivating visual experience, showcasing intricately designed software crafted explicitly for the AI Fabric on the SONiC platform. ONES doesn’t just fulfill the requirements of contemporary networking; it also enhances user interaction through intuitive visualization and advanced features. This platform signifies an innovative approach to orchestrating and visualizing networks across multiple vendors, delivering a customized solution for addressing the intricate nature of AI Fabric on the SONiC platform.

What’s next in store for our forthcoming blog series, where we’ll extensively explore these informative topics:

  • Detailed security compliance with ONES
  • In-depth analysis regarding the measurement of NWSLA

To immerse yourself in SONiC firsthand, visit ONES Center. Delve into a comprehensive case study of SONiC, please check out “Maximizing Success with SONiC”.

Categories
All

ONES 2.0: Unleashes Rule Engine and Alerting System for Seamless SONiC Operations

ONES Rule Engine is an advanced feature that enhances your network management experience by providing a seamlessly integrated alert and notification system. It offers comprehensive monitoring metrics and allows you to create device and interface level rules with ease. With ONES Rule Engine, you can have tailored control over your network management. Upgrade your network management game and experience with ONES Rule Engine today!

10 Benefits of Using ONES Rule-Engine for Comprehensive Network Monitoring

  • Comprehensive Monitoring
    ONES Rule-Engine takes a holistic approach to network monitoring by keeping an eye on diverse metrics such as CPU utilization, Memory utilization, PSU status, fans speed, RX/TX, and more. This breadth of coverage ensures that no aspect of your network goes unnoticed, providing a comprehensive view for proactive issue resolution.
  • Device and Interface Level
    It allows the creation of rules at both device and interface levels. This fine grained rule management ensures that specific devices or interfaces can be targeted for rule application, allowing for a tailored approach to network optimization and issue handling.
  • Rule Customization
    Rule-Engine understands the unique requirements of different network components. With device-level rules based on Hardware SKU, Role, and OS version, administrators can fine-tune alerts to align with the specific characteristics of their network infrastructure.

Figure 1: Rule Configuration
Figure 1: Rule Configuration
  • Device Inclusion & Exclusion 
    Flexibility is key in network management. The rule engine provides the capability to include or exclude devices from rules, ensuring that the rule engine caters to the specific needs of your network architecture. This feature enables a dynamic response to changes in the network environment.
  • Severity-Based Alerting
    The Rule-Engine facilitates the creation of Critical and Warning severity alerts, allowing administrators to prioritize responses based on the urgency and impact of potential issues. This hierarchical alerting system ensures that critical problems are addressed promptly, minimizing downtime and optimizing network performance.
  • Alert Summary for Collaborative Issue Resolution
    The system enables users to generate a comprehensive report of all alerts, facilitating effortless sharing with the team. This feature simplifies the collaborative resolution process, promoting efficient communication and knowledge transfer among team members.

Figure 2: Alert Summary
Figure 2: Alert Summary
  • Integration with Slack for real-time notifications
    ONES’ Slack integration ensures that critical alerts are delivered directly to designated channels, keeping teams informed and in sync. Additionally, weekly Slack digests provide a comprehensive overview of alerts and Zendesk ticket details, streamlining communication and collaboration.
  • Zendesk Integration for Streamlined Ticketing
    The rule engine seamlessly integrates with Zendesk, automating the creation of tickets based on alerts. This integration simplifies the ticketing process, providing a centralized platform for tracking and managing network issues.
  • Preventing redundant alerts leads to efficient alerting
    During the rule creation process, administrators have the capability to specify the maximum number of alerts for a particular metric on a specific device, mitigating the occurrence of redundant notifications. This feature contributes to a streamlined and efficient alerting system, enhancing the overall effectiveness of network management within the ONES 2.0 ecosystem.
  • Strengthening Monitoring and Response Capabilities with detailed alert information
    Each alert is enriched with essential details, including Metric Name, Type(Critical or Warning), Triggered Time and Associated Rule Information. Alerts also includes a URL that will redirect users to associated visual representations for better understanding. In addition, device information such as IP address, role, region, SKU, serial number, NOS etc are the part of alert details. Interface specific alerts will have the related additional information like the interface name , speed , Transceiver details as shown in below image Fig 3.

Figure 3: Alerts details on Zendesk
Figure 3: Alerts details on Zendesk

Figure 4: Alerts Summary on Slack
Figure 4: Alerts Summary on Slack

Rule Engine coverage

  • System Health
    Rules can be created to monitor system health like device’s CPU utilization, Memory utilization and CPU core temperatures and alert if those values exceed the critical or warning thresholds. ONES UI also provides the recommended thresholds for CPU and memory usage.
  • Alert on Component Failures   
    Rule engine can be used to alert if a device FAN or a Power supply unit (PSU) goes faulty. ONES backend keeps continuous track of component health and triggers an alert in case of failure.
  • Capacity Monitoring 
    Hardware switching is an important aspect in today’s network for high speed data transmissions. Situations can develop where the switch ASIC hardware limits are utilized and forwarding happens in software causing system instability. ONES rule engines have these monitored as well and rules can be created to notify if the ASIC IPv4 / IPv6 utilization exceeds the warning and critical levels.
  • Traffic Monitoring
    Set the utilization levels for traffic links , acceptable thresholds for errors and discards and alerts will be generated for links crossing the set levels.
  • Transceiver Health
    Transceiver operational values like Voltage, Temperature and Power are critical for having error free and lossless transmissions. Rule engine monitors those metrics and alerts the transceivers that are on verge of going rogue or requiring attention.
  • SONiC Services Health
    In addition to all the above , alerts can be generated for any BGP neighboring going down and for monitoring synced and for container cpu utilization.

Conclusion

Embrace the power of ONES 2.0’s Rule Engine and Alerting system to elevate your network management experience. With real-time monitoring of hardware, network, components, counters and transceiver health to enhance your SONiC journey with unparalleled support and advanced alert management through Slack and Zendesk integrations.

The alerts system goes beyond Slack or Zendesk integrations and can be customized to fit any platform based on the requirements.

Stay tuned for our upcoming blog series, where we’ll dive deep into these insightful topics:

  • RoCE Traffic Visibility in AI Fabric
  • Detailed security compliance with ONES
  • In-depth analysis regarding the measurement of NWSLA

Take a ‘test drive’ with ONES Center before SONiC Deployments with our well known vendors in hardware, platforms, ASIC and OS at your ease. Make your informed decision by testing it out with our multi-vendor, including Cisco SONiC, NVIDIA SONiC, Celestica SONiC, Marvell SONiC, Wistron SONiC, Edgecore Community SONiC, Arista SONiC, Supermicro SONiC, Enterprise SONiC, and DELL SONiC.

Categories
All

Aviz ONES 2.0: Empowering SONiC Deployments with Orchestration and Visibility

We’re excited to introduce ONES 2.0, a cutting-edge network operations tool that sets new standards for Visibility, Orchestration, and Support. This release represents a significant leap in our ongoing commitment to pushing the boundaries of network management capabilities and orchestration. 

Brimming with groundbreaking features, a polished user interface, improved data center orchestration through incremental configuration, upgraded NetOps APIs for seamless integration, and robust functionalities, ONES 2.0 empowers network teams to effortlessly streamline operations, delve into deeper insights, and ensure peak performance.

Get ready to embark on a journey where innovation meets excellence, as ONES 2.0 empowers your network endeavors with unparalleled sophistication and efficiency. It’s not just a tool; it’s a game-changer, designed to elevate your network experience to unprecedented levels. Welcome to ONES 2.0 !

Figure 1: ONES 2.0 homepage
Figure 1: ONES 2.0 homepage

Unlocking Data-Driven Success: 6 Key Benefits of Enhanced Telemetry

1. Data Center Interconnect Visibility

Enhance visibility of Data Center Interconnect topologies for Layer-2 Leaf-Spine, Rack-to-Rack Connectivity using EVPN-VXLAN and MC-LAG including control plane health and configurations.

2. Rule Engine and Alerts

Experience a robust Rule Engine and Alerts system integrated with Slack messaging and Zendesk ticketing. Monitor platform metrics, health, traffic bandwidth, and more. Stay proactive with customizable alerts and automatic ticketing.

Figure 2: Rule Configuration
Figure 2: Rule Configuration
Figure 3: Rule Alerts Summary
Figure 3: Rule Alerts Summary

3. AI-Fabric Control & Data Plane Visibility

Gain visibility into RoCE (RDMA over Converged Ethernet) metrics and RoCE link visibility for improved performance monitoring and visualize traffic flows on the topology page.

Figure 4: RoCE Traffic & Configuration
Figure 4: RoCE Traffic & Configuration

4. Firmware Compliance

Stay up-to-date with detailed firmware information for each switch, covering ONIE, BMC, BIOS, FPGA, and CPLD versions.

Figure 5: Firmware Compliance
Figure 5: Firmware Compliance

5. Enhanced Supportability Functions

  • Syslog Management: Simplify troubleshooting with a programmatic click in UI to collect the logs from devices, supported directly from the topology page.
  • Console Access and Inventory Download: Seamlessly access the console of switches directly from the ONES interface, simplifying device management. Effortlessly download the inventory details of all devices for efficient asset management.

6. Network SLA: Packet Loss & Latency

ONES 2.0 incorporates exclusive back-end support for Network SLA, enabling users to monitor packet loss and latency between any two end-points, measured using ICMP or TCP.

How does NetOps Ready Orchestration enhance enterprise deployments?

Incremental Config

ONES 2.0 adds support for incremental configuration changes. You may now deploy a template to configure your fabric and progressively update VLANs/VNIs across the fabric. This agility makes your network designs more adaptable.

Improved NetOps API

The NetOps API has been improved in 2.0 to accommodate several underlay configurations, including seamless underlay and overlay configuration. This capability not only provides for a broader range of use-cases, but also handles more deployment scenarios such as L2LS, L3LS, and flexible configuration and elastic scaling of spine and leaf nodes

New Use Cases & Features

  • L3 MC-LAG
    While L2 MC-LAG was already supported, ONES 2.0 adds L3 MC-LAG. This provides redundancy, load balancing, scalability, and easier management to meet diverse networking difficulties. Improve your network’s performance and reliability across a wide range of use scenarios.
  • Layer2 Leaf-Spine (L2/L3 Mode)
    Designed to meet the escalating demands of modern data centers, the Layer2 Leaf-Spine architecture offers low-latency, high-bandwidth connectivity with redundancy and efficient traffic distribution.
  • Rack-2-Rack Deployment
    A deployment scenario tailored for fabrics designed with interconnection with leaf devices exclusively, eliminating the need for spines. This streamlined configuration suits specific network architectures. 
  • BGP Peering over LAG
    Enable higher bandwidth, load balancing, and redundancy with this configuration use case. Multiple physical links are aggregated into a logical bundle, known as a PortChannel, optimizing network performance.
  • BGP Peering in MC-LAG Environments
    Configuring BGP peering among MC-LAG peers over the PeerLink ensures seamless operations during uplink failures, while additional interface peering expands network capabilities for optimized traffic handling and heightened resilience.
  • sFlow & DHCP Relay Support
    ONES orchestration tool’s recent support for sFlow and DHCP Relay within the data center, unlock a new level of network management.
  • Console Log
    ONES 2.0 provides administrators with a centralized console plane from which they can monitor network activity for various devices.This is quite useful for tracking the progress of the operation and recording logs in case of problems.
Figure 6: Console Log
Figure 6: Console Log
  • Monitoring Configuration & Operational Status
    In our latest release, track orchestration progresses seamlessly through our intuitive GUI. Gain real-time visibility into the progress of operational validation, empowering you to monitor every stage of your network’s deployment and verification with ease.
Figure 7: ONES UI Config and Operational Status
Figure 7: ONES UI Config and Operational Status

Explore the power and versatility of ONES 2.0 for SONiC with these exciting new features and UI enhancements. Elevate your network orchestration and management to new heights!

Conclusion

In summary, ONES 2.0 represents a significant advancement in network operations and management, establishing new standards in visibility, orchestration, and support. Packed with innovative features, an enhanced user interface, and expanded capabilities, ONES 2.0 empowers network teams to streamline operations, gain profound insights, and effortlessly ensure peak performance.

What’s next in our upcoming blog series, stay tuned to know following insightful topics:

  • Rule Engine, Alerts, and Notifications
  • RoCE Traffic Visibility in AI Fabric
  • Detailed Security Compliance with ONES
  • In-depth Analysis of NWSLA Measurement

Immerse yourself in the transformative capabilities of ONES 2.0 for SONiC, and join us on a journey toward seamless network monitoring and orchestration. Unlock the ONES 2.0 experience—schedule a demo on your preferred date, and let us show you how it’s done!

Categories
All

New Product Release Announcement: Aviz ONES 2.0 – A Milestone for SONiC Deployments

ONES (Open Networking Enterprise Suite) stands as a robust network management and supportability solution explicitly designed to tackle the distinctive challenges associated with transitioning to SONiC, an open-source Networking Operating System. Our comprehensive suite serves as a multifaceted Network Orchestration, Visibility, and Assurance platform catering to the complexities of operating multi-vendor and multi-NOS Network Infrastructure.

ONES, initially conceived as a comprehensive solution tailored for the challenges of SONiC migration, has evolved significantly. With the emergence of ONES 2.0, a remarkable milestone is achieved.

Aviz ONES 2.0 release emerges as a beacon, making a significant milestone in becoming a genuine multi-vendor SONiC Deployments, operations and AI-Fabric platform.

6 Key Features that Make Aviz ONES 2.0 a Game Changer:

1. Deep Insights with Enhanced Observability:

  • Advanced Topology View: Gain insights into underlay, overlay, and RoCE traffic Visibility in AI Fabric with an advanced topology view.
  • Detailed Metrics: Access detailed Protocols, Health and Capacity metrics presented with time series graphs for enhanced visibility.
  • Comprehensive Pages: Navigate through enriched pages for Inventory, Analytics, and Software, ensuring a holistic network view.
  • Expanded Metrics: Unlock additional metrics for devices and interfaces, enhancing analytical capabilities.
  • Quick Access: Swiftly connect with devices through an SSH button in the interface Down Widget.
  • Firmware Insights: Gain deeper insights with a Firmware details widget embedded in the Software page.

2. Rule Engine & Integrations:

  • Watcher Rules and Alerts: Implement Watcher Rules for devices and interfaces, coupled with configurable metrics.
  • Collaborative Integration: Strengthen collaboration with Zendesk and Slack integrations, ensuring seamless notifications.

3. Enhanced Compliance & Device Management:

  • Efficient Device Management: Streamline device management with syslog extraction, Console access, and Non-SONiC device controls.
  • Firmware Focus: Access comprehensive firmware information directly in the Device details, enhancing inventory management.
  • Data Handling: Export or download inventory with ease, simplifying data management.

4. Orchestration:

  • YAML Configuration Templates: Simplify configuration with YAML templates supporting features like BGP, L2/L3 MC-LAG, EVPN MultiHoming, and more.
  • Config Updates: Incremental updates for L2VNI/L3VNI configurations, ensuring efficiency in managing network changes.
  • Backup and Restore: Enjoy enhanced backup and restore options via a user-friendly UI, ensuring network stability.

5. Network SLA:

  • Performance Monitoring: Ensure optimal network performance with CLI Backend support for Packet Loss and Latency metrics.
  • Endpoint Flexibility: Measure performance between any two end-points using ICMP or TCP, providing flexibility in performance monitoring.

6. Product Security:

Regular security scans, robust certificate management, user account management, RBAC implementation, LDAP integration, mutual TLS certificates.

Conclusion:

Aviz ONES 2.0 signifies a quantum leap in SONiC supportability, offering advanced monitoring, rule-based alerts, orchestrated configuration updates, and NW SLA monitoring with enhanced UI. Embrace the future of networking with Aviz ONES 2.0 – where innovation meets reliability.

Keep an eye on this space for our upcoming blog series covering these insightful topics: 

  • ONES 2.0 Release features: Complete guide
  • Rulengine, Alerts, and Notifications
  • RoCE Traffic Visibility in AI Fabric
  • Detailed security compliance with ONES
  • In-depth analysis regarding the measurement of NWSLA

Contact us for personalized guidance and tailored solutions to optimize your SONiC experience. Our team is here to assist you in navigating the complexities, answering your queries, and exploring the best SONiC solutions suited for your unique requirements.

Categories
All

Why Normalizing SONiC for Multi-vendor Network Infrastructure is Important and How to Do It?

A multi-vendor network strategy is a forward-thinking approach that enables organizations to harness the full potential of SONiC. It revolves around the adoption of SONiC as a common NOS while deploying hardware sourced from multiple vendors within the same infrastructure. One of its key advantages is the flexibility; organizations can select the best-suited hardware to meet specific requirements without being constrained by a single-vendor solution. Additionally, the strategy encourages healthy competition among vendors, leading to better pricing, feature innovation, and support options.

While a multi-vendor network strategy brings a multitude of benefits to organizations seeking consistency and efficiency in their network deployments, it also entails normalization of SONiC, its operational data, tools, and processes to ensure a unified and standardized environment is available for administrators and operators. In this blog, we will define and elaborate on why and how to normalize SONiC across diverse hardware platforms and foster interoperability to achieve greater flexibility and agility in your network infrastructure.

Why Normalize SONiC for Multi-Vendor Networks: Overcoming Challenges and Maximizing Benefits

Today, every forward looking networking team, regardless of their size, is looking to increase the capacity and performance of their network infrastructure. Everyone wants to embrace the latest advancements in ASIC functionalities, and deploy the latest hardware and software that allows them to scale and control their infrastructure on their own terms. But relying on a single vendor for hardware can lead to lock-in, where a company becomes overly dependent on that vendor’s products, services, and ecosystems.

While there are clear advantages to avoiding dependence on a single hardware vendor, it’s also important to note that managing multi-vendor deployments can bring its own set of challenges, such as increased complexity in network management, integration, and support. SONiC brings the layer that creates the balance between the benefits of vendor diversification with the operational challenges of a multi-vendor network strategy. But, even SONiC presents several challenges:

  1. Management Complexity: Vendors typically have their own versions, configurations, management tools, and interfaces for SONiC. This heterogeneity complicates infrastructure management and monitoring.
  2. Compatibility Risks: As each vendor updates their version of SONiC, there is increased risk of introducing compatibility issues with other components in the overall network.
  3. Integration Hurdles: The variance in SONiC versions from each vendor often leads to integration issues resulting from configuration/telemetry inconsistencies across platforms.
  4. Support Variability: Different vendors offer varying levels of support, both in terms of quality and responsiveness, this can make troubleshooting issues more complicated.
  5. Training Demands: Operators might need training for vendor specific hardware and software nuances, which can lead to longer onboarding times.

Hence, for consistent behavior and operability of using SONiC across different hardware platforms, normalization is essential. Normalization brings:

  1. Single Quality Standards & Consistent Visibility: To ensure that the OS provides consistent behavior and features regardless of the underlying hardware, and operational metrics are consistent across different platforms.
  2. Ease of Deployment: To ensure operators can expect similar configurations, behaviors, and deployment processes across different hardware.
  3. Standardized Troubleshooting: To ensure the environment emits consistent logs and alerts regardless of the hardware it runs on.
  4. Common Training & Documentation: To ensure network professionals have access to common and consistent resources and behaviors don’t vary based on the underlying hardware.
  5. Community Collaboration: To ensure that effective collaboration can occur between the users, the vendors, and the open-source SONiC community.

In essence, normalization of SONiC ensures that its benefits are fully realized without introducing unnecessary complexities and inconsistencies in the network operations. More importantly, normalization reinforces that the NOS layer doesn’t lean too heavily towards any particular hardware vendor’s idiosyncrasies.

Achieving SONiC Consistency: Best Practices for Multi-Vendor Networks

Achieving consistency and interoperability with SONiC across multiple hardware platforms requires a combination of best practices, rigorous testing, purpose-built tools, and unified support, together referred to as normalization of SONiC. Here are some steps, considerations, and recommendations to ensure a unified experience for multi-vendor SONiC deployments, and to help maximize the TCO savings from adopting SONiC.

  1. Comprehensive SONiC Evaluation
    Evaluating SONiC requires a comprehensive approach that considers both its technical capabilities and viability within your network infrastructure. Each organization has their own requirements for routing protocols, throughput, latency, reliability, telemetry, etc., including costs. You should start with creating a list of requirements that meets your network’s demands, and then look for options that fit your bill. Many organizations prefer POCs for such evaluations, but POCs can be expensive and time consuming, especially when they entail procurement of multiple hardware from different vendors. 

    At Aviz, we identified this problem, and created what we call the Open Networking Experience Center for SONiC (ONE Center). ONE Center is a lab environment designed to showcase the capabilities of SONiC across a wide range of hardware. It gives you hands-on experience with SONiC and an avenue for doing SONiC evaluations.

    In real-world scenarios without upfront investment into hardware. Aviz even provides SONiC Test reports specific to your evaluation criterias highlighting viable choices of Switches that may meet your needs.
  2. Rigorous Testing of SONiC Builds
    Contrary to popular belief, the community SONiC may not be the right choice for your organization. While the community version provides all the necessary baseline features, oftentimes your exact feature requirements can only be met by vendor specific distros. This is very typical of any open-source software as the overall progress of the community hinges upon the contributions from community members and vendors alike. 

    Hence, it is important to identify what version will work for you, but it is even more important to perform rigorous testing before and after deployment. We recommend testing SONiC builds for functionality, performance, and scalability across your selected platforms, specifically for the use cases pertaining to you. At Aviz, we have developed a test suite to ensure that quality standards for SONiC meet the expectations of our customers.

    Our Fabric Test Automation Suite (FTAS) is designed to verify SONiC for Data Center and Edge deployment readiness. Moreover, FTAS is a fully automated CI/CD based solution ready to plug into existing network infrastructures for pre, and post deployment testing scenarios. We have helped multiple customers in establishing a vendor-agnostic unified SONiC build pipeline that can be rigorously tested before and after deployment. In some cases, we even manage the customer GitHub repositories that host the builds normalized for our customer use cases.
  3. Unified Configuration Management
    Deploying SONiC in multi-vendor environments makes it critical to have a unified configuration management system that ensures network administrators can seamlessly deploy, modify, and maintain their configurations across multiple hardware platforms. 

    Leveraging tools like YANG models, organizations can standardize SONiC configurations and data structures, presenting a consistent interface to their teams. Furthermore, a unified approach helps in streamlining troubleshooting and auditing activities, eliminating inconsistencies that might arise from platform-specific configurations. 

    At Aviz, we have taken it one step further by developing a unified SONiC Fabric Manager, a part of our Open Networking Enterprise Suite (ONES) solution, which can help orchestrate and configure multi-vendor SONiC networks with ease. 

    ONES can create, configure various topology configurations for Leaf, Spine, and Super-spine layers, apply them, and validate them on your entire fabric automatically. It even allows you to compare running configs against your applied configs. Additionally, ONES boasts features such as ZTP that can help upgrade your switches with custom SONiC images, something that is extremely critical for keeping your SONiC versions up-to-date.
  4. Standardized SONiC APIs for Integration
    While the Switch Abstraction Interface (SAI) at the core of SONiC is designed to ensure a consistent API layer across various ASIC vendors and the NOS, unfortunately, the platform components that make up the switch, bring their own nuances, leading to inconsistencies that often introduce complexities in integrating SONiC with various NetOps tools. 
    Standardized APIs across SONiC versions play a pivotal role in ensuring streamlined NetOps integration to deliver management agility, automation, and unified analytics. Standardizing APIs across a diverse range of hardware platforms with multiple variants of SONiC running on them can be extremely tedious, and hence a deterrent to your integration requirements. This is another area where we have put in a lot of effort to deliver Standardized APIs for nearly every variant of SONiC to our customers. 

    Our ONES platform has built-in REST APIs for both Day 1 and Day 2 operations. These REST APIs can easily be integrated with your NetOps tools, eliminating the need for you to invest your time and resources into Standardizing APIs for your SONiC operations.
  5. Unified Monitoring System Across Platforms for SONiC Deployments
    As organizations move towards adopting open-source SONiC over proprietary networking solutions, the one thing that everyone seeks is a single-pane-of-glass for monitoring their multi-vendor infrastructure. Everyone expects a Cisco/Arista grade experience not only because of the ease of use they brought, but also because no one wants the added complexity of using multiple systems for each of their vendor specific components.

    In an environment where SONiC operates across various hardware vendors, the potential for disparate monitoring tools and fragmented visibility is high. A cohesive visibility across all hardware components simplifies monitoring, troubleshooting, and promotes proactive network management.

    Aviz ONES offers administrators a unified, comprehensive view of their entire SONiC network landscape. It streamlines workflows, eliminating the need to juggle multiple vendor-specific tools, maximizing operational efficiency. In essence, a single-pane-of-glass approach maximizes and ensures consistent, high-quality network performance across diverse SONiC deployments.
Open Networking Enterprise Suite (ONES) by Aviz Networks
ONES overview
  1. Common Support Channel Across Vendors (NVIDIA, Cisco, Edgecore, Marvel…)
    No one wants to deal with multiple support channels for their infrastructure. The decision to go multi-vendor SONiC brings the critical challenge of having to deal with multiple support teams, ticketing tools, processes, timelines, SLAs, and more. In the world of SONiC, where hardware from various vendors coexist, the ability to receive consistent, reliable, and comprehensive support across all vendors is of paramount importance. A unified support ensures that network administrators have a single channel to raise issues, or seek guidance regardless of who is the supplier of their hardware. 

    At Aviz, we have created that unified support channel for SONiC by working tirelessly with our customers for years and collaborating with nearly every major ASIC and Switch vendor in the industry. Our team of SONiC experts guarantees that the nuances and idiosyncrasies of each vendor’s platform is well-understood and addressed timely for our customers. 

    We are proud to have partnered with the likes of Broadcom, Celestica, Cisco, Edgecore, Marvell, Micas, Nvidia, Supermicro, Wistron, and many more to deliver the much needed unified support for the SONiC ecosystem. 

    In most cases, our global teams, available 24×7, are able to meet SLAs as low as 30 minutes, since our contracts with hardware vendors have enabled us to provide a seamless experience. Additionally, with all the experience and expertise we have amassed over the years, we have been able to automate the SONiC troubleshooting via our carefully crafted runbooks that are normalized across multiple platforms. 

Conclusion

As organizations embrace SONiC for their multi-vendor networks, achieving consistency becomes paramount. To ensure seamless migration to, and interoperability with SONiC:

  1. Conduct a comprehensive end-to-end evaluation of hardware options tailored to your needs
  2. Choose the right SONiC distributions, and ensure rigorous testing is performed
  3. Utilize a unified configuration management system to streamline deployments
  4. Utilize standardized APIs to facilitate efficient NetOps integrations
  5. Utilize unified monitoring solutions for multi-vendor SONiC management
  6. Identify a centralized support partner that can collaborate with all hardware vendors

Adopting these strategies will maximize the benefits of multi-vendor SONiC deployments.

Contact us today, how we can support your networking needs

Categories
All FTAS

FTAS 2.0: Enabling Customer Driven Quality Standards for Multi-Vendor SONiC Deployments

The Importance of Quality Standardization in Community SONiC Based Networks

While traditional network deployments rely on proprietary solutions from switch vendors for quality assurance, the shift towards community SONiC presents a challenge: maintaining high-quality standards without turning SONiC into a proprietary system. The solution? FTAS (Fabric Test Automation Suite).

FTAS empowers customers to manage the diversity of community SONiC with a simple, one-click quality check. It enables data centers to transition to Community SONiC confidently, ensuring that any vendor’s switch aligns perfectly with their specific requirements through standardized and automated FATS test cases.

The evolution of FTAS from its initial focus on a few vendors and use cases to the more comprehensive FTAS 2.0 highlights our commitment to expanding support and adapting to a wider range of customer needs. This advancement underscores our dedication to fostering standardized quality assurance in the Community SONiC ecosystem.

FTAS (Fabric Test Automation Suite) Overview

Overview of Aviz Networks’ Fabric Test Automation Suite (FTAS)

Four-device topology for testing scenarios in FTAS

The Fabric Test Automation Suite by Aviz Networks is a groundbreaking testing solution, combining the consistency of SONiC’s CLI with advanced testing methodologies. It represents a significant advancement in switch testing, catering to the complex demands of modern network fabrics with a commitment to quality and performance. Following are a few innovations we have done with FTAS.

  • Comprehensive Testing: Using SONiC CLI: FTAS leverages the uniformity of SONiC’s Command Line Interface across various vendors, enabling extensive and thorough testing of switch functionalities. This ensures a robust validation of core switch operations within complex network infrastructures.
  • Integration with Jenkins for CI/CD Testing: The suite is compatible with Jenkins, facilitating a ready-to-use Continuous Integration/Continuous Deployment (CI/CD) pipeline tailored for Network Development Operations (NetDevOps). It supports testing on both physical switches and GNS3-based virtual appliances, enhancing its versatility as a true virtual system.
  • In-depth Fabric Analysis: FTAS is specifically designed for detailed analysis of switch behavior within network fabrics. It uses SONiC’s consistent CLI to explore and understand intricate fabric interactions, highlighting the suite’s capability for detailed network scrutiny.
  • Tailored for Modern Network Needs: The suite is developed with a focus on the evolving requirements of contemporary networks. It offers a flexible and adaptable toolset, poised to address the dynamic nature of modern network technology landscapes.
  • Focus on Performance, Reliability, and Scalability: Aviz Networks has a clear mission with FTAS – to enhance network performance, reliability, and scalability through rigorous and comprehensive testing. This focus ensures networks are not just meeting but exceeding performance expectations.
  • Commitment to On-going Excellence: FTAS embodies Aviz Networks’ dedication to providing high-quality testing solutions. It ensures optimal network performance and resilience, showcasing the company’s commitment to excellence in network testing.

Optimizing Test Cases and Quality Quest Across the Entire SONiC Ecosystem with Major Vendors

Open Networking Experience (ONE Center): SONiC evaluation sandbox for diverse hardware.

Aviz Networks has always taken an inclusive and comprehensive approach working with all of our partners. Community comes with diversity and with that it comes with a lot of heavy lifting which is required to do normalization for complex test cases. Below is the high level process we used to achieve a standardized test suite which can now work on all the vendors registered in ONE center, and hence available for customers to to POC or setup a CI/CD lab of their own and start on NetDevOps journey.

  • Selective Test Case Automation: The team at Aviz Networks embarked on an automation journey, carefully selecting key test cases for automation. This selection process was thorough, ensuring that the chosen test cases were crucial for the automation objectives – typically selected by high value customer use cases.
  • Rigorous Cross-Platform Testing: The automated test cases were subjected to intensive testing across various platforms available in the Aviz lab. This included a range of platforms from industry leaders such as Cisco, NVIDIA, Edgecore, Wistron, and others, ensuring a comprehensive testing process.
  • Platform Independent (PI) Test Suite: A significant number of these test cases were identified as platform-independent. They demonstrated consistent behavior across all tested platforms. These were grouped into the Platform Independent suite, making them versatile for execution on any platform within the ecosystem.
  • Identification of Platform Dependent (PD) Test Cases: Some test cases showed variations in behavior across different platforms. This was due to differences in the outputs of show commands or the support of different feature sets by various switches. These test cases were categorized as Platform Dependent, acknowledging their specific applicability to certain platforms.

In summary, the optimization of test cases across the SONiC ecosystem at Aviz Networks involved a strategic selection and categorization of test cases, ensuring effective automation across a wide range of platforms. This process led to the creation of both Platform Independent and Platform Dependent test suites, accommodating the diverse nature of the testing environment and the specific characteristics of different platforms.

Customer Guide to Accessing and Using FTAS

FTAS serves a dual role, catering to both switch vendors and end customers with its versatile applications. It’s a dynamic tool, continuously evolving to incorporate the latest in SONiC community developments.

For Switch Vendors

  • R&D and Product Development: Utilized in the research and development phase, FTAS aids in crafting high-quality SONiC solutions.

For End Customers

  • Proof of Concept (POC) Testing: As a subscription service, it enables end customers to conduct POCs. This facilitates testing of multi-vendor SONiC solutions, helping in informed decision-making regarding vendor selection.
  • CI/CD Lab Management: FTAS is also instrumental in managing Continuous Integration/Continuous Deployment (CI/CD) labs, streamlining NetDevOps processes for efficient SONiC deployment.

Continuous Enhancement

FTAS is not static; it’s a ‘living entity’ that regularly integrates new use cases, platforms, and updates from the SONiC community. This ensures users always have access to the most advanced and comprehensive tools available.

If you are looking for a comprehensive and easy-to-use testing solution for your SONiC network, then FTAS is the perfect option for you. For more detailed information about the supported suites and their configurations, visit: FTAS Test Suite Configuration


FTAS Reporting Mechanism: Delivering Comprehensive Test Insights

Test Execution report- user-friendly HTML format with detailed test results and logs for transparent and efficient testing experience
Overview of the FTAS Test Results – Execution Report

The Next Level of Testing: Introducing FTAS 2.0 and its Exciting Features

Working with customers and partners, we focused on below three items in FTAS release 2.0: 

  • Making it more robust for the existing use cases and adding new test cases as per new customers
  • Adding more platforms for higher speeds and feeds and also new ASICs/Platforms
  • Making the test cases more flexible so custom test cases can be executed

Detailed list for 2.0 release can be found here

Conclusion

FTAS is an invaluable asset for corporations exploring SONiC or switch vendors seeking enhanced support. It revolutionizes testing by offering comprehensive coverage with reduced effort and expense. By eliminating the need for extensive teams, complex test planning, and manual execution, FTAS streamlines the testing process. It ensures accurate, efficient outcomes at a fraction of the traditional cost and effort.

Ready to explore? Book a demo with us on FTAS, and how it works?

Categories
All

The Old NOS is Dead! Long live the SONiC King!

What happened to the trailblazers in SDN?

The software-defined networking (SDN) craze began with the promise of a disaggregated Network Operating System (NOS). The ability to have hardware and silicon diversity, with carrier-class network performance and resiliency, took flight.

Vyatta, Pica8, Pluribus, and Cumulus are some of the companies that emerged when SDN was new, and OpenFlow and OpenStack were all the rage.

SDN attempts to reshape the Intersection between NetOPs and DevOPs

Programmatic capabilities and CLIs driving network transformation towards 'as code' approach

Programmatic capabilities and command-line interfaces (CLIs) are becoming increasingly popular. The network and its infrastructure are being influenced by the DevOps movement and are transitioning to an “as code” approach.

As infrastructure starts to virtualize, DevOps teams are building overlays for their own rapid service expansion on top of the existing network. This has led to the creation of VXLAN. The worlds of NetOps and DevOps have collided as a result of this trend.

The convergence of NetOps and DevOps occurs due to widespread infrastructure virtualization, giving rise to VXLAN

Ring a bell?

Now, let’s talk about the fate of those NOS players…

  • Vyatta – Acquired.
  • Pica8 – Receivership. Assets Acquired. Best of luck.
  • Pluribus – Acquired. Your sole hardware choice.
  • Cumulus – Acquired. Broadcom support discontinued. A familiar tale, just like Pluribus.

Shifting Tides in the NOS Space: The Evolution and Future of SONiC with Microsoft and the Linux Foundation

SONiC, alternatively known as Azure Software for Open Networking in the Cloud (SONiC), a cross between NetOps and DevOps, SONiC emerged unexpectedly from the heart of Microsoft, a source that might have been the least anticipated.

Thank you to the Microsoft Azure team! 🙂

Microsoft has handed over the development of SONiC to the Linux Foundation and Open Compute Platform (OCP). This move ensures that SONiC will remain open-source and will not be acquired or end-of-lifed. Ever. 

The Linux Foundation will focus on the software component of SONiC and will work with OCP to align hardware and specifications like SAI.

SONiC’s Rise to Dominance: Navigating the Competitive Landscape of Disaggregated NOS in the Network Market

Initially, there were some reservations about SONiC; however, it has since become a dominant force powering many platforms, including Azure, Google, and META, just to name a few. At this point, SONiC momentum is unstoppable!

It seems like the disaggregated Network Operating System (NOS) market has become highly competitive, with some NOS struggling to survive. The market is similar to a game of “NOS Thrones,” where only the most capable NOS will survive and claim market share dominance. This competition is not limited to data centers but also extends to hyperscalers and the edge (both access and the RAN/ORAN Cell Site).

How to address the rapidly changing and disappearing NOS space with a long term solution?

According to a report by 650 Group, the worldwide market for SONiC in data center switching is expected to exceed $4 billion in revenue by 2025. The report also suggests that SONiC’s role outside of hyperscalers will increase during the forecast period.

The big technology conglomerate like Cisco has adopted SONiC on their 8000 series hardware and has partnered with Aviz Networks to support it for their customers. It is predicted that SONiC will have a market share of 40% by 2025, which is second only to Cisco.

So the question is not “What do I do?”, but as Nick Fury would say, “What are you prepared to do?”! Let’s discuss the most effective solution to the Cumulus Broadcom problem and other disappearing NOS.

You can either choose:

a) Another NOS that can perform the required tasks with some degree of uncertainty.

or

b) Opt for a more reliable alternative that includes Management, Automation, Orchestration, AND Observability, along with Support for the NOS in one package that can go the distance, such as Aviz with SONiC.

From Setbacks to Success: Embracing Aviz Networks + SONiC for Trustworthy NOS Solutions

Hear ye, Hear ye, The old NOS are Dead! Long Live The SONiC King!

The new SONiC King replaces outdated NOS

I invite you to join me for a unique blog series. 

You may recall my previous episodes in the Pica8 NOS land, where the receivership left customers in the lurch for months. I lived through it, and that Pica8 sting is still fresh. Similarly, Cumulus left a sting that still burns.

Let’s turn that burn into a raging success! How do you regain trust after a bad situation? The same way you always do. Get on top of a better solution! The better, future-proof solution is Aviz Networks + SONiC.

Aviz Networks and SONiC collaboration for network solutions

I’ve gained invaluable insights, and from my experience, I can attest it’s never too late to make the right choice.

Join me in this Blog Series and my exciting journey into Digital Transformation with SONiC at Aviz Networks.

Discover the superpowers you never knew you had, simplifying and rapidly integrating SONiC into your ecosystem, whether you’re dealing with HyperScalers or others!

Let me take you on the adventure that will supercharge your digital transformation into a performance-driven, budget-saving, AI-powered machine, all on a smaller budget. We will show you how to do more with less. Stay tuned for Episode 2! 😊

Categories
All

Fortifying ONES (Multi-Vendor SONiC Stack) – The Enterprise-Grade Security to Safeguard Data Center Fabrics

In today’s fast-paced digital landscape, safeguarding your enterprise is paramount. With cyber threats constantly evolving, having a robust security strategy is non-negotiable.

Securing Your Enterprise with ONES and SONiC (Software for Open Networking in the Cloud): This Comprehensive Guide Talks About

  • Focus on Enterprise Product Security: Exploring essential aspects of securing enterprise products
  • Fortifying ONES: Detailing how we’ve strengthened ONES for enterprise SONiC customers
  • Pivotal Security Elements: Highlighting crucial security components like security scans, Certificate Authorities (CAs), user account management, Role-Based Access Control (RBAC), LDAP, and Mutual TLS (Transport Layer Security)

Fortifying Your Enterprise: 8 Essential Enterprise Security Practices

  • Regular Security Scans: Perform frequent security scans to identify vulnerabilities and weaknesses
  • Robust Certificate Management: Establish a reliable CA infrastructure to ensure trust in digital certificates
  • User Account Hygiene: Enforce strong password policies, implement MFA, and monitor user accounts for suspicious activity
  • RBAC Implementation: Assign roles and access permissions based on job responsibilities, and regularly review and update them
  • LDAP Integration: Centralize user and resource management with LDAP to improve security and network efficiency
  • Implement Mutual TLS: Secure communication between systems and services with mutual TLS for enhanced data protection
  • Streaming Telemetry and Continuous Monitoring: Start with collecting data from various sources such as logs, network traffic, and endpoint devices. Advanced analytics and machine learning are employed to identify anomalous behavior and potential security incidents
  • Security Patches: Must-have tools in the ongoing battle against cyber threats. They are updates released by software vendors to address known vulnerabilities and weaknesses in their products

Aviz Networks commences its journey with customers right from the pre-deployment stages. Our dedicated customer success teams collaborate closely with enterprise security and audit teams to align their strategies and processes with security objectives. 

To learn more about our successful partnership with SONiC, we invite you to explore our case study: “Maximizing Success with SONiC.” Discover firsthand how Aviz Networks delivers reliable and secure solutions to empower your network infrastructure.

Let’s understand how we support multi-vendor SONiC deployments without compromising on the enterprise security requirements.

Revolutionize Your Networking with ONES: The Open Networking Enterprise Suite

ONES is a network orchestration, visibility, and assurance solution for multi-vendor and multi-NOS operated network infrastructure. It provides a one-stop solution, right from delivering deep network visibility into your data center networks to extending 24×7 SONiC support. This solution also hosts a powerful analytics engine that assists users in identifying network issues and troubleshooting their networks, in case of common network anomalies and disruptions.

We focused on network security as the primary tenet while building ONES to cater to our enterprise SONiC customers and ensured the product adhered to all the best practices mentioned above. This blog highlights how the best practices are implemented in ONES.

Open Networking Enterprise Suite (ONES) - Simplify your network management.
ONES Overview

Streamlining Security Measures with Automated Scans

While customers perform security scans on software images, nonetheless, we have integrated and automated security scans within the CICD pipeline to ensure the integrity of software packages. 

Aviz runs security checks, installer scan, SAST/ DAST (Static/Dynamic Application Security Testing) using SynK, SonarQube, etc. to ensure the robustness of the ONES application and identify any vulnerabilities against malicious attacks and potential security risks. 

We adopt a CICD framework that integrates security into all phases of the software development lifecycle to reduce the risk of releasing code with security vulnerabilities.

Vulnerability Patching: Crucial updates to strengthen system security and counter cyber threats.
CICD Framework

Ensuring Secure Communication with HTTPS CA Certs

ONES strongly enforces HTTPS over standard port 443 coupled with certificates signed by trusted Certificate Authority (CA). We firmly believe that HTTPS with CA certs is the sole method of safeguarding sensitive information and privacy while the data transfers between systems and services in an enterprise environment.

Setting Up User Accounts and Role-Based Access Control

ONES is designed in such a way that every user has an independent ONES account and is never required to share credentials with others. However, we have also created a ‘super admin’ account that can be used for troubleshooting and recovery in case of any individual account issues, for example – a locked account or forgotten password, etc.

Troubleshoot and recover with ONES independent user accounts and super admin.
Account Management – User accounts

In addition to user accounts, ONES provides a fine-grained RBAC to restrict access to special features. It ensures that the individuals have the appropriate level of access based on their roles and responsibilities within the organization.

Ex: Critical Switch operations like reboot, ZTP can be allowed for Vendor staff.

  • Super admin
  • Enterprise Admin
  • Enterprise Staff
  • Vendor Staff
ONES provides fine-grained RBAC for user accounts, limiting access to features based on organizational roles and responsibilities.
Account Management – Roles and Permissions

Benefits of LDAP for Centralized User Authentication

LDAP simplifies user authentication and directory services in enterprise environments. It centralizes user account information, making it easier to manage access and permissions. Integrating LDAP into your security strategy enhances user management and access control while promoting scalability and efficiency. ONES application extends integrations with customer identity management solutions such as Active Directory and uses LDAP to communicate with Active Directory to authenticate users.

What is Mutual TLS and How Does it Ensure Secure Communication?

ONES is designed to support Mutual TLS (Transport Layer Security), or mTLS, which is a security mechanism that ensures both parties in a communication exchange can trust each other’s identity. It’s particularly valuable for securing data transfer between systems and services in an enterprise environment. ONES utilizes gRPC infrastructure to communicate with switch agents. TLS is the primary security protocol used by gRPC to secure the communication between the client and the server. TLS provides authentication, confidentiality, and integrity of data. Authentication is achieved using digital certificates which verify the identity of the client and the server.

Continuous Compliance Monitoring with ONES: Real-time Metrics and Alert Capabilities

ONES enables streaming telemetry and continuously collects metrics for software compliance such as software versions (NOS, Kernel, and ONIE software versions), EOL (End of Life) licenses, and security vulnerabilities. Also, ONES enables policies and alert capabilities to ensure that organizations remain compliant with regulatory requirements and security policies. It provides a real-time view of compliance status and helps in identifying and remedying compliance issues promptly.

Continuous Monitoring for Compliance with Ones: Streamlined telemetry and ongoing metric collection for software compliance.
Dashboard – Software Compliance

What Are the Benefits of Vulnerability Patching?

Security patches are essential tools in the ongoing battle against cyber threats. They are updates released by software vendors to address known vulnerabilities and weaknesses in their products. These patches are designed to bolster the security of your systems, close potential entry points for attackers, and mitigate the risk of exploitation. ONES is built using cloud-native and microservice design principles. Therefore, it allows container upgrades without impacting the data path or application downtime. It also allows updating security fixes or vulnerability patches without upgrading the whole system. Moreover, ONES continuously monitors for security vulnerabilities and leverages the CICD to timely update the patches to the system.

How to Secure API Endpoints with ONES?

Securing an API with an enterprise product involves a combination of strategies, tools, and best practices. ONES implements user authentication using API tokens or JWT to ensure that only authorized users and applications can access the API. ONES is containerized and all the services are hosted behind an API gateway to rate limit API to endpoints.

Conclusion: Comprehensive Approach to Enterprise Product Security

In an era of evolving cyber threats, fortifying your enterprise is not just a choice – it’s a necessity. By adopting a comprehensive approach to security, leveraging essential practices, and implementing cutting-edge technologies like ONES and SONiC, you can establish a robust defense against potential vulnerabilities.

Key Takeaways:

  • Regular security scans, robust certificate management, user account management, RBAC implementation, LDAP integration, and Mutual TLS are fundamental security practices that form the bedrock of a secure enterprise environment.
  • Implementing these practices ensures trust, integrity, and confidentiality in data transfer and access control.
  • At Aviz Networks, we’re dedicated to support you from pre-deployment to post-deployment, ensuring alignment of strategies with your security goals.

Security Assurance:

Prioritizing security not only shields your organization but also instills trust in your customers and partners. They can rely on you to safeguard their sensitive information and maintain the integrity of your products and services. Our products adhere to best practices during the commissioning of sandbox and production deployments.

Interested in experiencing the power of ONES firsthand? We invite you to request a ONES demo. Our team is ready to connect with you and your team, providing insights and solutions tailored to your specific security requirements.

Stay Vigilant:

Remember, security is an ongoing process. Stay vigilant, regularly update your security measures, and adapt to emerging threats to ensure the ongoing safety of your enterprise.

Categories
All

Why Gartner Preferred SONiC Over Open Networking in 2023 Hype Cycle: 5 Point Analysis by Aviz Networks!

Gartner just published their Enterprise Networking Hype Cycle for 2023. The first thing I noticed was that Gartner had replaced “Open Networking” with “SONiC” in its 2023 chart. In fact, the report firmly stated “Open networking has been replaced on the Hype Cycle with SONiC, which garners the most client interest of any open networking technology.”   

Gartner not only swapped it, but pulled the timeline from 5-10 years to 2-5 years. This in turn shows SONiC’s acceleration that open networking in general was never able to achieve.

Aviz Networks, being a community SONiC leader in the enterprise market, felt that it was time that Gartner sees networking as we see it. We had already predicted this shift and put our thoughts here: “open vs open source networking”.

In this blog, I will go over our five-point analysis based on our insights on why this shift has happened and the path forward.

Gartner Enterprise Networking Hype Cycle report comparison 2022

Gartner Enterprise Networking Hype Cycle report comparison 2023
Gartner Enterprise Networking Hype Cycle report comparison 2022 vs. 2023

Aviz Networks’ 5 Point Analysis

1. Open networking has become a ​​cliché 

A true open networking model promised that the networks can be open for choosing any hardware or software. The promise was to bring down the TCO (by CapEx and OpEx), increase the choices by removing lock-ins (on hardware and software), and standardize the operating system layer (by enabling standard APIs). 

In the past decade, many open networking companies (such as Cumulus) started with a portion of open networking vision where they focused on disaggregating the hardware and software while keeping the software proprietary. These companies called themselves as open networking companies, but essentially they were disaggregated NOS companies. 

While these Network Operating System (NOS) companies proved that the disaggregated model works, they failed to deliver on the entire vision of open networking because it was simply a disaggregation model, which provided partial promises of open networking.

The eventual sale of most of these NOS companies to specific vendors made it clear that while disaggregated NOS works, it is not open enough – it is not future-proof!

Network Operating System (NOS) landscape and consolidations in the past decade
Network Operating System (NOS) landscape and consolidations in the past decade

2. Incumbent networking vendors and public cloud competed with open networking

Open networking by definition should have been open for the entire networking ecosystem. Big incumbents who owned more than 90% share of the networking market were not a part of open networking in the past decade, as open networking meaning got reduced to just having a disaggregated proprietary NOS.

Public cloud adoption was another big trend – which was completely missed by open networking vendors. It created a void in the open networking value proposition. Only multi-cloud/hybrid-cloud startups or existing incumbent vendors were able to solve the integration problem via their end-to-end vertical integrated stack.

Open networking vendors (typically start-ups) were not able to survive long enough competing with incumbent giants. These startups were typically focused on the battle of “my BGP is better”. This wrong kind of competition resulted in spending energy on reinventing the wheel instead of real innovation and coming up with a solution that customers needed (open source based control, choices on hardware and software, public cloud enablement, and finally the TCO savings generated from both CapEx and OpEx).

3. Disaggregated NOS (under open networking label) came with a lock-in 

Disaggregated NOS companies often used the term open networking which gave customers an impression that the NOS was open source and there was no lock-in. However, that was not true.

Customers who started with their disaggregated NOS journey later figured out that while some components of the NOS were built on open source, the code itself was not open source. Further, they were slowly getting locked-in to a vertical stack that was not as mature as incumbent vendors—all of that while paying 15-20% less than proven systems from incumbent vendors.

Customers eventually also figured out that if the operating system is owned by a company that’s the worst lock-in you can get into. Time after time it’s been proven that the competition between ASIC vendors has brutally hurt the end customers. Cumulus and Bigswitch are good examples of that. While the companies consolidated, the customers lost on open networking promises and an open network became a locked network.

4. Customers got CapEx savings but OpEx spend did not change

TCO savings was one of the important selling points of disaggregation (under the umbrella of open networking) that came mainly from choosing the cheapest hardware. Customers liked the savings part and the element of choice because it gave them a way to lower their CapEx by reducing the hardware price. 

The incumbent vendors (Industry mainstream vendors) addressed this by lowering the price of their hardware. The customers with huge deployments from incumbent vendors simply used quotes from “open networking” vendors to bring down the prices of hardware. Though the incumbent vendors did not like it, they still did it—as they pushed the margins towards OpEx.

Open networking vendors did not have an OpEx savings story as the disaggregated NOS price and support were essentially OpEx. This approach of doing CapEx-based TCO savings only helped in the beginning but it did not survive for long as the OpEx piece was still not solved.

5. SONiC completes the open networking vision at the right place and time with proven examples

Microsoft had figured out the issue with the disaggregation model way back and worked on a true open networking solution, which they named SONiC. That is open networking done the right way. By the time the disaggregated NOS companies started to consolidate, SONiC was getting steam in the 2018-2019 time frame.

It had all the elements of open networking. It was open sourced and had a thriving community. It was also well supported by all networking vendors and its quality improved to deployment grade. This provided the true benefits to the customers which open networking promised. Here’s a quick summary of SONiC’s advantages:

  • Choice: SONiC ecosystem enables any white box or incumbent vendor switches for customers.
  • Control: This ground-breaking NOS does not belong to anyone, so there will never be a lock-in. Customers now can standardize their NetOps on top.
  • Savings: SONiC not only enables customers to save on CapEx, but also on OpEx (as NOS is free and customers pay only for support).

Gartner’s shifts – aligned with what customers are looking for. The path forward!

In my conversations with customers, before I start talking about SONiC, I often ask – what are the goals they are trying to achieve? Typically every networking team gives us some of these inputs on what they are looking for:

  • Latest speeds & feeds and ASIC functionalities for next-generation workloads
  • Future-proof networks with zero dependency on a single vendor
  • Minimize budget spikes with every refresh (save on TCO in a consistent and predictable manner)
  • Network open for in-house control and innovation
  • Standardized NOS API layer for any NOS (proprietary or open source) with minimal spend on NetOps

While Microsoft and a few other F100 companies have demonstrated that these problems can be solved with SONiC, enterprise customers have questions about how they can do it exactly like them. Many times, we hear customers asking the following questions and looking for an objective answer or a process that is consistent and predictable:

  • How can a customer get incumbent vendors like experience with SONiC?
  • How can SONiC be normalized for consistent behavior across multiple vendors, while still being 100% open source?
  • How to calculate precise TCO savings over the next few years in the multi-vendor environment (multi-vendor = any hardware and any NOS)?
  • How to enable in-house innovation without hiring multiple engineering experts in-house? 
  • How to get enterprise-grade support irrespective of a switch or ASIC vendor?
  • How to integrate with other applications (be it in private or in public cloud)?

We, at Aviz Networks, are working with our partners, community and customers to accelerate SONiC adoption. Our team has developed the processes, commercial ecosystem and an entire end to end open networking stack that is truly open for any ASIC, any switch, any NOS and any cloud, while future proofing it for SONiC as the foundation.

If you have already started or are thinking about exploring the SONiC path, you can schedule a demo or contact us for a free strategy review session. Being at the forefront of this revolution, I will be happy to host a free demo/call with you and help you maximize your SONiC success while you introduce it in the mix of any other NOS in your existing environments.

Categories
All

Power up your NetOps Support Framework for SREs with ONES Innovation

In today’s interconnected world, Network Operations (NetOps) Support Framework is crucial for organizations to maintain a robust and reliable network infrastructure. It provides the foundation to manage and optimize network performance, ensure seamless connectivity, and address other related issues. In this post, we bring you an overview of NetOps Support Frameworks, their key components, and significance in maintaining efficient operations. We also talk about SLAs and their benefits in NetOps Support Framework.

Components of NetOps Support Frameworks

Let’s quickly glance through a few critical components.

1. Network Monitoring and Management

This component covers:

  • Real-time monitoring of network devices and traffic
  • Performance analysis and reporting
  • Configuration management and compliance
  • Network inventory and asset management

The next-generation management tools offer extensions for supporting advanced functions that include:

  • Network Orchestration
  • Streaming Telemetry

Network Orchestration and Telemetry Streaming work together to enable the automation, control, and visibility of network operations while leveraging real-time telemetry data for enhanced network management and analysis. Let’s understand these functions in detail.

Network Orchestration

This function represents the overall system responsible for orchestrating and automating network operations, including configuration management, service provisioning, and network policies.
It includes a core component, Orchestration Engine, that receives high-level commands/policies and further, translates them into actionable tasks for Network devices. A network device is a physical or virtual one that makes up the network infrastructure such as a router, switch, firewall, or load balancer.

Telemetry Streaming

This function represents the process of collecting, aggregating, and forwarding real-time network telemetry data to various telemetry consumers for analysis and decision-making purposes.
Here, Telemetry Collector acts as an intermediary component responsible for collecting telemetry data from network devices, leveraging protocols like gRPC, NETCONF, or SNMP. Telemetry Consumers refer to the applications, systems, or analytics platforms that consume and analyze network telemetry data. These consumers can include network monitoring tools, data analytics platforms, and machine learning systems.

2. Fault Management and Troubleshooting

This component includes: 

  • Rapid detection and isolation of network issues
  • Root cause analysis and remediation
  • Incident management and escalation processes

3. Change Management and Configuration

content-managment

1. Control and coordination of network changes
2. Version control and documentation
3. Change approval processes and tracking


4. Performance Optimization

Performance-Optimization

1. Capacity planning and bandwidth management
2. Quality of Service (QoS) implementation
3. Traffic engineering and optimization
4. Proactive network optimization strategies


5. Security and Compliance

Security-and-Compliance

1. Network security monitoring and threat detection
2. Firewall management and access control
3. Compliance with industry regulations (for example PCI-DSS, GDPR)
4. Vulnerability assessment and patch management

Supporting Multi-Vendor NOS and Switch Hardware

In today’s diverse networking landscape, organizations often rely on a mix of network operating systems (NOS) and vendors to meet their specific requirements. However, managing and supporting multi-vendor NOS environments poses unique challenges that can be streamlined with specialized NetOps Support Frameworks. Multi-vendor NOS integration in NetOps Support Frameworks requires an understanding of interoperability challenges and the need for standardized management frameworks. For a seamless multi-vendor NOS support, vendor-agnostic network monitoring and management are primarily needed for: 

1. Consolidated monitoring of dashboards for heterogeneous network devices
2. Integration with various NOS APIs for unified device management
3. Leveraging standardized protocols (for example SNMP, NETCONF, RESTful APIs) for device communication
4. Managing and troubleshooting cross-vendor faults:
    a.  Correlation of alerts and events from different NOS vendors
     b.  Centralized incident management and ticketing system
     c.  Collaboration with vendor support teams for issue resolution
5. Change management and configuration:
     a.  Standardized configuration templates for different NOS vendors
     b.  Integration with configuration management databases (CMDB)
     c.  Change tracking and rollback mechanisms for multi-vendor environments
6. Performance optimization and traffic engineering:
     a.  Bandwidth allocation and optimization across diverse NOS platforms
     b.  QoS implementation for consistent performance across vendors
     c.  Traffic engineering strategies for load balancing and optimization

Importance of Service Level Agreements (SLAs)

In network infrastructure support, SLAs define the agreed-upon expectations/responsibilities between service providers, like Aviz Networks, and their customers. These SLAs outline key performance indicators such as service availability, response times, and other parameters.

Therefore, these play a vital role in ensuring that the network meets desired service levels and provides a satisfactory user experience. Let’s deep dive into more details: 

  1. KPIs: SLAs outline multiple KPIs such as network availability, packet loss, latency, throughput, and response times. By benchmarking the metrics, SLAs provide a quantifiable means for evaluating the performance of network infrastructure as well as service provider. 
  2. Network Availability: SLAs specify the expected level of network availability, typically expressed as a percentage of uptime over a given period. This metric indicates how often the network should be operational and accessible to users. It also ensures the accountability of a network service provider for maintaining a reliable and continuously available network infrastructure.
  3. Response and Resolution Times: SLAs often include response and resolution time commitments for network incidents or service requests. The response time defines how quickly the service provider should acknowledge and respond to reported issues. The resolution time sets expectations about the time required to restore the network service to its normal functioning state.
  4. Downtime and Maintenance Windows: Another benefit of such agreements is the provision for scheduled maintenance windows during which network services may be unavailable temporarily. By establishing a clear schedule and notifying customers in advance, SLAs help manage expectations and minimize service disruptions.
  5. Escalation Procedures: SLAs outline escalation procedures to follow in case of critical incidents or service disruptions. This ensures that prompt actions are taken to address the issue and involve higher-level support or management, if necessary.
  6. Remedies and Compensation: SLAs include provisions for remedies in the form of service credits, discounts, or other types of compensation to mitigate the impact of service disruptions/failures caused by the service providers.
  7. Reporting and Review: Lastly, these agreements usually include reporting mechanisms to track and communicate network performance against the agreed-upon metrics. Regular performance reports and service reviews enable both parties to assess the network’s performance, identify areas for improvement, and ensure transparency and accountability.

Benefits of SLAs in NetOps Support

  1. Improved Operational Efficiency:
    a.  Streamlined management processes for diverse NOS platforms
    b.  Reduced complexity and overhead associated with managing multiple vendors
    c.  Centralized visibility and control over the entire network infrastructure         
  2. Enhanced Network Resilience and Performance:
    a.  Rapid fault detection and resolution across different NOS environments
    b.  Optimal utilization of network resources through unified performance optimization strategies
    c.  Consistent security measures and compliance enforcement across vendors               
  3. Customer Satisfaction and Business Continuity:
    a.  Adherence to SLAs for ensuring service reliability and customer satisfaction
    b.  Minimized downtime and faster incident resolution through SLA-driven support processes
    c.  Risk mitigation associated with multi-vendor environments         

Introducing ONES (Open Networking Enterprise Suite)

ONES from Aviz Networks is a network observability/visibility, orchestration, and assurance solution for network switches running SONiC and vendor-proprietary NOS (Network Operating System).

ONES provides a one-stop solution, right from providing better visibility into your data center networks to extending 24×7 support function for SONiC. It also hosts a powerful analytics engine that provides Proactive, Predictive, and Prescriptive Analysis of common network anomalies and disruptions.

The key capabilities of ONES include:

  1. Purpose-built solution for SONiC deployments
  2. Supports multiple NOS for comprehensive visibility
  3. Orchestration and deep telemetry for observability
  4. 24×7 enterprise-grade support options for SONiC

ONES – Value and Beyond

MONITOR

Monitor your entire multi-NOS fabric

Manage inventory of your network devices running any Network OS on Broadcom, Marvell, Nvidia, and other leading ASICs
View topology of the entire fabric across multiple hardware platforms, and network operating systems
Monitor traffic, system health, bandwidth utilization, and more between and across devices

ORCHESTRATE

Configure your SONiC fabric with ease

Create and configure CLOS topology for ToR, Leaf, Spine, and Super-spine layers
Apply and validate configurations pre- and post-deployment
Compare running configs against applied configs at any point
Upgrade devices with a single-click via ZTP or custom NOS Images

SUPPORTABILITY

NetOps Simplified

Proactively track Switch CPU/memory consumption, bandwidth, link failures, traffic errors, and more
Instantly connect to individual devices for maintenance and quick troubleshooting
Collaborate across your teams and with our SONiC experts to solve issues more efficiently

Traditional Network Orchestration tools have evolved from just delivering and monitoring network functions for proprietary NOS to designing and building network fabrics in an automated and intent-based approach.

ONES takes the Orchestration journey to the next level—adding capabilities from SONiC NOS across a fleet of multi-vendor and multi-ASIC switches, bringing together capabilities of streaming telemetry, API programmability, network control, intent-based fabric configuration, and SLA assurance for supportability.

Predictive failure/health analytics and capacity planning enable Orchestration tools (like ONES) to provide a seamless adoption journey for SONiC by leveraging historical trends of resource utilization, traffic patterns, logs/events, and derived application/workload performance.

Supportability, a crucial feature of Network Orchestration tools, goes beyond just notifying and alerting. It also enables integration with IT tools/engine to check anomalies or events correlation using real-time or historical data, single-touch management, and in turn, simplify switch/fabric onboarding for scale.

With the rapid adoption of open-source SONiC, ONES has emerged as a one-stop solution for network infrastructure teams. It seamlessly enables orchestration, deep telemetry, and assurance for multi-vendor deployments. Most importantly, the 24×7 SRE support enables them to introduce SONiC in their networks with utmost confidence.

Author:
Arakkal Kunju Mohammed Yasser, Director of Engineering, Site Reliability Engineering

Power up your NetOps Support Framework for SREs with ONES Innovation

In today’s interconnected world, Network Operations (NetOps) Support Framework is crucial for organizations to maintain a robust and reliable network infrastructure. It provides the foundation to manage and optimize network performance, ensure seamless connectivity, and address other related issues. In this post, we bring you an overview of NetOps Support Frameworks, their key components, and […]