AI Model Performance metrics

less than 1 minute read

Published: January 17, 2025

- Perplexity

this measure reliably evaluates the model’s text prediction accuracy.
A language model with a perplexity of 20 is choosing from 20 equally likely options for each word in a sequence, per www.comet.com.
Lower perplexity signifies greater confidence and better predictions.

- Temperature

controls the randomness of output.
A common issue is hallucination, where a model generates false information.
Lowering the temperature can reduce, but not eliminate, hallucinations.

- Latency

evaluates a model’s response time, essential for interactive experiences.

  End-to-End Latency (E2E) = Network Latency + Model Processing Time + Output Generation Time + Post-processing Time.

- Token Throughput

the model’s processing speed and its token handling per second.

```For example, if a model generates 1,000 tokens in a batch of 50 within 10 seconds, the throughput is (1,000 * 50) / 10 = 5,000 tokens per second.```

*** Note: these metrics are important for model monitoring and improvement.

Share on

X (formerly Twitter) Facebook LinkedIn

Configuring Wifi in ESP32 WORM using code

1 minute read

Published: June 15, 2024

Recently, I have been delving into a specific use case that involves consuming a voice REST endpoint using the ESP32 microcontroller. This task requires not only utilizing the capabilities of the ESP32 but also ensuring that the device is connected to a Wi-Fi network for seamless communication with the endpoint.

Data mocking using Faker

2 minute read

Published: May 31, 2024

Ideally, test data is of priority and the project teams always face an issue in getting the relevant and realistic test data for pre-production activities. More issues(refresh of data; data manipulations etc.,) arise, when programs consume data from a shared environment. Sometimes, requirements of data varies and a new set of data should be replicated through external tools and technologies. Many commercial data mocking/stubbing tools are available in the market, but as a open source lover, I recommend using Faker library.