Who is Shubhendu Vaid?

Shubhendu Vaid is a Senior Software Engineering Manager and Engineering Leader at BT Group in London, UK, with 14+ years of experience building and scaling software platforms. He specializes in Next.js, React, Node.js, TypeScript, cloud architecture, and engineering leadership.

What is Shubhendu Vaid's current role and responsibilities?

Shubhendu leads the BT Business Broadband SMB Learn–Buy–Get acquisition and renewals journey. He defines technical strategy, architects scalable front‑end and cloud solutions, and grows high‑performing teams across the UK and India.

What technologies does Shubhendu Vaid specialize in?

Shubhendu specializes in Next.js, React, Node.js, TypeScript, Electron, AWS, GCP, MongoDB, event‑driven architecture, microservices, Java, Spring Boot, Angular, Playwright, and Generative AI. He also has deep experience with CI/CD, TDD, observability, and agile delivery.

What are Shubhendu Vaid's key achievements at BT Group?

At BT Group, Shubhendu rebuilt B2B sales and renewals experiences, improving conversion and reliability, and helped drive a 5x improvement in page performance. He architected a CMS adapter framework, scaled teams from single‑squad delivery to multi‑squad programs, and delivered LLM‑powered customer‑help and recommendation features.

Can I book Shubhendu Vaid for paid advisory or consulting?

Yes. Shubhendu offers paid 1:1 advisory on engineering leadership, system/solution architecture reviews, performance and platform strategy, and Generative AI adoption. Use the contact form or email to propose your topic and preferred time.

Where is Shubhendu Vaid located and available for work?

Shubhendu is based in London, UK, and is open to senior engineering leadership roles and consulting engagements across the UK and Europe, including remote and hybrid setups.

What is Shubhendu Vaid's educational background?

Shubhendu holds a Bachelor of Technology in Computer Science from Rajasthan Technical University, Kota.

How can I contact Shubhendu Vaid for opportunities?

You can reach Shubhendu via email at vaidshubhendu@gmail.com, connect on LinkedIn at linkedin.com/in/shubhendu-vaid, or view his work on GitHub at github.com/ShubhenduVaid. He’s happy to discuss roles, consulting, or leadership coaching.

Efficiently Retrieving Data from 1 Billion Records

Ever tried to fetch one user out of a billion and watched your latency graph do the cha-cha? At this scale, hardware helps, but data modeling decides whether reads stay boring. This walkthrough uses Cassandra, Docker, and Spring Boot to keep read paths predictable.

TL;DR

Fast reads at billion-scale are a data-modeling problem, not just a hardware problem.
Partition by country, use a compound primary key, and query by partition to keep latency stable.
Stack: Apache Cassandra + Docker + Spring Boot.
A Postman lookup returned 27 ms for a single record out of a billion (local, network excluded).

Why this problem matters

Billion-row datasets punish naive access patterns. If your query fans out across partitions, you get tail latency spikes and unpredictable performance. The goal is to make reads boring and repeatable.

This solution uses:

Apache Cassandra for distributed, high-throughput storage
Docker for repeatable local infrastructure
Spring Boot for clean API integration

Pro-tip: design your queries first, then model the table to serve those queries.

Prerequisites

Docker
Spring Boot
Postman

Step 1: Create a Cassandra cluster on Docker

Start by creating an isolated network for Cassandra:

docker network create cassandra-network

Run a seed node:

docker run -d --name cassandra-seed \
  --network cassandra-network \
  -e CASSANDRA_CLUSTER_NAME="MyCluster" \
  -e CASSANDRA_SEEDS="cassandra-seed" \
  -e CASSANDRA_LISTEN_ADDRESS="cassandra-seed" \
  -e CASSANDRA_BROADCAST_ADDRESS="cassandra-seed" \
  -p 9042:9042 \
  cassandra:latest

Add node 1 and node 2:

docker run -d --name cassandra-node1 \
  --network cassandra-network \
  -e CASSANDRA_CLUSTER_NAME="MyCluster" \
  -e CASSANDRA_SEEDS="cassandra-seed" \
  -e CASSANDRA_LISTEN_ADDRESS="cassandra-node1" \
  -e CASSANDRA_BROADCAST_ADDRESS="cassandra-node1" \
  cassandra:latest

docker run -d --name cassandra-node2 \
  --network cassandra-network \
  -e CASSANDRA_CLUSTER_NAME="MyCluster" \
  -e CASSANDRA_SEEDS="cassandra-seed" \
  -e CASSANDRA_LISTEN_ADDRESS="cassandra-node2" \
  -e CASSANDRA_BROADCAST_ADDRESS="cassandra-node2" \
  cassandra:latest

Connect via cqlsh:

cqlsh

Step 2: Create keyspace and schema

Replication factor of 3:

CREATE KEYSPACE billion
WITH replication = {
  'class': 'SimpleStrategy',
  'replication_factor': 3
};

Create the users table with a partition key of country:

CREATE TABLE billion.users (
  id UUID,
  name TEXT,
  email TEXT,
  dob DATE,
  address TEXT,
  phone TEXT,
  country TEXT,
  date_created TIMESTAMP,
  PRIMARY KEY ((country), id)
);

| Geek Corner | |:--| | The partition key is the routing slip. Pick it well and reads stay local and fast. Pick it poorly and you spray queries across nodes. |

Step 3: Spring Boot API for reads/writes

Create a User service with three endpoints:

@GetMapping("/{country}/{id}")
public ResponseEntity<User> getUserByCountryAndID(@PathVariable String country, @PathVariable UUID id) {
  return userService.getUserByCountryAndId(country, id)
      .map(ResponseEntity::ok)
      .orElse(ResponseEntity.notFound().build());
}

@GetMapping("/{country}")
public ResponseEntity<List<User>> getUsersByCountry(@PathVariable String country) {
  return userService.getUserByCountry(country)
      .map(ResponseEntity::ok)
      .orElse(ResponseEntity.notFound().build());
}

@PostMapping
public ResponseEntity<User> createUser(@RequestBody User userRequest) {
  User user = new User();
  user.setId(UUID.randomUUID());
  user.setName(userRequest.getName());
  user.setEmail(userRequest.getEmail());
  user.setAddress(userRequest.getAddress());
  user.setDob(LocalDate.now());
  user.setPhone(userRequest.getPhone());
  user.setCountry(userRequest.getCountry());
  user.setDate_created(LocalDateTime.now());
  user = userService.saveUser(user);
  return ResponseEntity.ok(user);
}

Create a second Spring Boot service (UserCreater) to generate large volumes:

@GetMapping("/generate-users")
public String generateUsers() {
  userService.generateAndPostUsers(1_000_000_000);
  return "User generation started!";
}

Tip: DbVisualizer is a simple free UI for Cassandra inspection.

Reality check: generating a billion rows is heavy on time and storage. Start smaller to validate schema and read paths, then scale.

Step 4: Validate performance

Using Postman, querying a specific user by country + ID returned:

27 ms for a single record out of a billion.

This excludes network latency, but confirms the partition strategy keeps reads fast.

Cleanup

After testing:

DROP TABLE billion.users;

Source code

User Creator: https://github.com/ShubhenduVaid/javaUserCreater
User Service: https://github.com/ShubhenduVaid/javaUser