HomeCloud ComputingCisco Safe AI Manufacturing unit attracts on Splunk Observability

Cisco Safe AI Manufacturing unit attracts on Splunk Observability


Synthetic intelligence is reshaping each trade, and unlocking its full potential requires infrastructure that’s strong, scalable, safe, and observable. As organizations broaden their AI initiatives, managing complicated workloads and guaranteeing constant efficiency change into mission-critical.

That is the place Cisco AI PODs, the foundational constructing blocks of Cisco Safe AI Manufacturing unit with NVIDIA, mixed with the deep visibility of Splunk Observability Cloud, ship a robust resolution for constructing and operating fashionable AI environments.

Cisco AI PODs: The inspiration for AI innovation

Cisco AI PODs are modular, versatile, and scalable AI infrastructure designed to speed up time to worth for AI tasks. They permit organizations to deploy production-grade AI environments rapidly—however to maintain these environments operating optimally, groups want complete perception into efficiency and well being.

How will you detect points early, troubleshoot effectively, and concentrate on delivering enterprise outcomes as a substitute of spending time addressing pressing manufacturing points? That’s the place observability turns into indispensable.

Splunk Observability: Your eyes and ears inside AI PODs

Slide for Cisco Secure AI Factory with NVDIA - displaying a vertically integrated deployment option built on Cisco AI PODs and including NVIDIA AI software, Kubernetes platform, Cisco AI networking, Cisco Compute with NVIDIA Accelerated Computing and partner storage. Additional pillars include Cisco Security and Splunk Observability.Slide for Cisco Secure AI Factory with NVDIA - displaying a vertically integrated deployment option built on Cisco AI PODs and including NVIDIA AI software, Kubernetes platform, Cisco AI networking, Cisco Compute with NVIDIA Accelerated Computing and partner storage. Additional pillars include Cisco Security and Splunk Observability.

Splunk Observability Cloud delivers end-to-end visibility throughout each layer of Cisco AI PODs—from bodily infrastructure to Kubernetes to the AI functions layer.

It’s not nearly information assortment. Splunk turns metrics, traces, and logs into actionable insights, serving to groups detect, troubleshoot, and resolve points in seconds.

We’re excited to introduce a brand new Splunk Dashboard purpose-built for observability throughout all the AI POD stack.

Screen display of Cisco AI PODs dashboard's AI POD overview.Screen display of Cisco AI PODs dashboard's AI POD overview.

What the brand new Splunk Dashboard brings to Cisco AI PODs

  • Unified Kubernetes cluster monitoring – Get a single view of all Kubernetes clusters, together with Pink Hat OpenShift operating on AI PODs.
  • Deep host-level insights – Monitor the efficiency of particular person Cisco UCS servers, together with CPU, reminiscence, disk, and community utilization.
  • AI POD infrastructure dashboard – Observe vital metrics like GPU utilization, GPU reminiscence utilization, energy, and community efficiency, integrating information from Cisco Intersight and Cisco Nexus.
  • Streaming analytics benefit – Leverage Splunk’s real-time streaming analytics to realize quicker detection and near-instant “time to glass.”

Whereas Cisco AI PODs present modular, scalable infrastructure for enterprise AI, every AI POD may also be monitored individually. This enables groups to achieve detailed perception into the distinctive efficiency metrics and workloads of a particular deployment. Listed below are some screens from the Splunk Dashboard for AI PODs to assist visualize the monitoring capabilities. By aggregating the variety of enter and output tokens processed by the massive language mannequin (LLM) operating on an AI POD, Splunk is ready to calculate an approximate value for token utilization over time:

Screen display of Cisco AI PODs dashboard's AI POD on the Tokenomics tab.Screen display of Cisco AI PODs dashboard's AI POD on the Tokenomics tab.

Splunk additionally pulls in metrics from Cisco Intersight, to supply visibility to lively alarms associated to the monitored AI POD, and key UCS metrics comparable to UCS host energy, temperature, and fan velocity:

Screen display of Cisco AI PODs dashboard on the Intersight tab.Screen display of Cisco AI PODs dashboard on the Intersight tab.

The Nexus dashboard supplies perception into the interfaces configured on every Nexus swap, the transmit errors and drops, and the information transferred between storage and compute nodes:

Screen display of Cisco AI PODs dashboard on the Nexus Switches tabScreen display of Cisco AI PODs dashboard on the Nexus Switches tab

An actual-world situation: Diagnosing LLM latency

Think about an utility operating on a Cisco AI POD using an LLM for person queries. All of the sudden, response instances on the appliance spike. Right here’s how Splunk Observability Cloud helps resolve it in minutes:

  1. Alert triggered – Splunk detects excessive response instances and raises an alert.
  2. Hint evaluation – The service map highlights that the majority latency happens inside /v1/chat/completions calls to the LLM.
  3. Infrastructure view – The AI POD dashboard reveals that solely one of many 4 out there GPUs is lively and totally utilized.
  4. Actionable perception – You reconfigure the workload to make use of all GPUs—immediately restoring efficiency.

The NVIDIA connection: Powering clever workloads

Splunk Observability additionally screens key NVIDIA AI Enterprise elements—together with the NVIDIA NIM operator and NVIDIA NIMs microservices for LLM inferencing—guaranteeing the NVIDIA software program stack performs at its greatest.

FedRAMP and authorities readiness: Splunk’s present path in direction of attaining FedRAMP Reasonable for Splunk Observability

Splunk stays a trusted accomplice in authorities digital transformation, empowering businesses to ship safe, resilient, and clever companies by means of cloud and customer-managed options. Constructing on the success of Splunk Cloud Platform—approved at FedRAMP Excessive and DoD Influence Stage 5, and listed on the StateRAMP (dba GovRAMP) Approved Merchandise Listing—Splunk continues to spend money on increasing our FedRAMP program to fulfill evolving public sector wants. As beforehand introduced, Splunk Observability Cloud has already obtained “In Course of” designation and awaits full authorization to function on the Reasonable degree from the FedRAMP Program Administration Workplace. Splunk stays dedicated to supporting the safety and mission success of all our authorities prospects.

Observability: A cornerstone of Cisco Safe AI Manufacturing unit with NVIDIA

In Cisco Safe AI Manufacturing unit with NVIDIA, observability isn’t optionally available—it’s foundational.

By delivering deep, real-time insights throughout infrastructure and functions, Splunk Observability Cloud enhances:

  • Operational effectivity
  • Useful resource optimization
  • Reliability and uptime
  • Safety posture

This holistic visibility is crucial for constructing, working, and securing complicated AI pipelines at scale.

Conclusion

Cisco AI PODs ship the strong, scalable infrastructure required for at this time’s demanding AI workloads. When paired with Splunk Observability Cloud, organizations achieve unmatched visibility and management—enabling fast troubleshooting, optimum efficiency, and quicker innovation.

Splunk Observability kinds a core pillar of Cisco Safe AI Manufacturing unit with NVIDIA, empowering companies to construct and run AI with confidence, velocity, and safety.

 

RELATED ARTICLES

LEAVE A REPLY

Please enter your comment!
Please enter your name here

- Advertisment -
Google search engine

Most Popular

Recent Comments