Infrastructure

Telemetry and Metrics with Run:ai

by
Alon Lavian
–
April 15, 2024

Doing more with Run:ai’s APIs

Run:ai 2.17 delivers major improvement to telemetry consumption with the release of a new API and metrics service. This new offering provides a more comprehensive and user-friendly way to access and analyze telemetry data, making it easier for teams to gain insights into the performance and behavior of your applications.

The new API provides a RESTful interface that allows users to programmatically access telemetry data, which can be integrated with other applications and tools for a more comprehensive view. The metrics service provides a collection of pre-built dashboards and reports that allow quick and easy visualization. The combination of the new API and metrics service simplifies the process of identifying data trends and patterns enabling teams to make informed decisions about their applications.

Value Proposition

In addition to the general benefits of our platform, we offer specific value propositions tailored to different roles and their unique needs;

Discovery - Data Analysts, Developers, and Power Users

  • Granular Data Access: We allow users to drill down to the most granular level of data, ensuring access to the specific information needed for analysis.
  • Custom Filters: Users are able to create customized filters to narrow down data sets and focus on the relevant subsets needed for the analysis.
  • Advanced Queries: There’s now complete access to utilize advanced SQL queries to extract complex insights from the data, empowering teams to conduct an in-depth analysis.
  • Task Automation: Users can easily automate repetitive tasks by leveraging scheduled API calls. This feature allows you to trigger specific actions based on predefined workload conditions, saving time and effort.

Insights - Product Managers, System Administrators, and Business Analysts:

  • Pre-Built Dashboards and Reports: Access a library of pre-built dashboards and reports that provide valuable insights. These reports are designed to meet common business needs, enabling fast visualization and analysis of key metrics.
  • Custom Dashboard and Report Builder: Create custom dashboards and reports tailored to specific requirements. The intuitive drag-and-drop interface makes it easy to build visually appealing and informative reports that convey key data effectively.

Action - Executives and Decision-Makers

  • Real-Time Alerts: Stay informed with real-time alerts indicating critical events or changes in the data enabling rapid response to emerging issues or opportunities.
  • Custom KPIs: Define custom key performance indicators (KPIs) that align to business goals. Real-time KPI tracking delivers a clear view of progress toward objectives.
  • BI Integration: Seamlessly integrate with preferred business intelligence (BI) tools enabling analysis and visualization delivering a unified view of your business performance.

‍

Use Cases and Examples

Build custom dashboards

With the new APIs teams can create customized dashboards, tailored to their needs. Use the different telemetry and metrics to create dashboards that will help you use Run:ai better. We’ve created the new Overview dashboard in the app using the new APIs to help you get started. Try the following endpoints

  • /cluster/metrics
  • /nodepools/metrics
  • /workloads/telemetry

‍

Check your resources' availability

Using the /nodes/telemetry endpoint you can monitor your free nodes availability and smartly plan the scheduling of large workloads.

‍

Monitor your workload

Dashboards give the bigger picture, but sometimes you want to monitor a specific workload that is of importance to you. Try the following endpoints;

  • /workloads/{workloadId}/metrics 
  • /workloads/{workloadId}/pods/{podId}/metrics

These new features will help you improve your productivity and make better decisions about your applications. We encourage you to try out the new API and metrics service and let us know what you think.

Curious about the entire release? Check out the announcement here. 

Ready to get started? Book your demo today and see how Run:ai can help you accelerate AI development and increase efficiency.