Sriram Selvam

Sriram Selvam

I'm a Senior Software Engineer at Microsoft AI, where I build generative AI solutions with a focus on generative search and large language model (LLM) applications deployed on distributed systems. Outside of work, I explore ethical challenges in LLMs and conduct research aimed at addressing critical gaps in current AI research systems, with a particular focus on long-term privacy and responsible model behavior.

Large Language Models Distributed System Generative Search System Design Independent Researcher 14 Years of experience

Writing

Guide

Running Language Models Locally

Feb 18, 2025

Language models are transforming the way we interact with AI, with services like ChatGPT, Copilot, Gemini, and Perplexity offering seamless conversational ..

Research Read

Does ChatGPT's memory contain PII?

Feb 2, 2025

Yes, it does. Though it doesn't divulge that information easily, it may be possible to extract PII with specialized attacks. In the last post What makes LLM memorize things?

Research Read

What makes LLM memorize things?

Jan 26, 2025

As we enter 2025, LLMs have become ubiquitous, and it's now widely understood that they can retain information from their training data. We've all witnessed this in ..

Selected Projects

Copilot Search

Active

Copilot Search seamlessly blends the best of traditional and generative search together to help you find what you need.

I work on identifying and integrating enrichments—such as answer cards, videos, and images—with LLM-generated text responses. Along with my team, I build and maintain the online enrichment engine that serves millions of users every day.

Generative Search

Active

This experience combines the foundation of Bing's search results with the power of large and small language models (LLMs and SLMs). It understands the search query, reviews millions of sources of information, dynamically matches content, and generates search results in a new AI-generated layout to fulfill the intent of the user's query more effectively.

I'm one of the core founding team members behind Bing's Generative Search result generation framework (patent pending). Together with my team, I designed and built the initial framework for enrichment data retrieval, LLM/RAG-based enrichment pairing techniques, and LLM-driven evaluation and metrics. This pipeline has powered the generation of millions of enriched result pages, significantly enhancing user experience at scale.

PANORAMA: A synthetic PII-laced dataset for studying sensitive data memorization in LLMs

Active

I co-developed PANORAMA, a large-scale synthetic dataset of 384K samples from 9.6K realistic human profiles to model the distribution and context of PII in online content. This was developed specifically to help researchers working in the space to reliably evaluate their mitigation strategies and quantify various modeling techniques for their efficacy at not remembering sensitive private information.

Using OpenAI's o3-mini, we generated diverse web-native formats—social posts, reviews, wikis, and more—with embedded sensitive data. By fine-tuning Mistral-7B across varying data replication rates, we analyzed memorization patterns across content types, providing insights into privacy risks in LLM training. PANORAMA enables robust model auditing and privacy-preserving research, with open-source data and tooling.

Creator and Code Assist

Past

This feature helps user get tailored LLM generated responses for their tasks with minimal prompting. Feature focuses on various areas such as email writing, story creation, code queries and creation.

I developed the prompts and the prompt templating engine that powers this experience. In addition, I was responsible for assembling the UX customization panel, enabling flexible and user-friendly interactions.

Industry Experience

Microsoft Corporation

Jun 2020 – Present

Senior Software Engineer

Worked across multiple teams in Bing including Copilot Search, Generative Search, Creator, Personas, and Knowledge Graph enrichment using LLMs.

  • Worked with various teams in Bing to onboard various enrichment experiences into Copilot Search
  • Designed and built the framework for collecting candidates and selecting the enrichments to go along with generated text responses.
  • Designed and built the offline GPT data generation engine powering creation of millions of result pages for Bing Generative Search
  • Filed patent with US PTO for the work on Generative Search
  • Developed dynamic prompt system and customization panel for Bing Creator & Personas, expanding support to coding and productivity segments
  • Increased engaged DAU by 15x through new segment integration and response control mechanisms
  • Built service APIs to customize LLM response traits (style, content, creativity)
  • Enhanced Bing's Knowledge Graph (Satori) using GPT-4 to extract contextual details while respecting ranking logic
  • Powered entity refinement tabs (e.g., Cast, Watch, Plot) and proposed and developed experiences like Black Friday shopping deals
  • Maintained high-quality data generation pipeline for millions of dynamic entities

IP Infusion

Nov 2013 – Aug 2018

Technical Lead Engineer

Led and contributed to protocol design, data center networking, and kernel-level enhancements for major clients including Ciena, Fujitsu, and Apple.

  • Led team in implementing Path Computation Element Protocol (PCEP) per RFC 8281/8231 for Ciena's SDN MPLS tunnels
  • Developed BGP Labelled Unicast by unifying IP/MPLS routes; received Best Individual Contributor award
  • Implemented IP Fast Reroute in IS-IS protocol achieving zero traffic loss; recognized by IP Infusion CEO
  • Designed RSTP for MC-LAG to eliminate broadcast loops in data center networks
  • Enhanced Linux Kernel 3.16.7 to simulate L2 control protocols on VMs for protocol testing
  • Delivered full VRF support for SSH, TELNET, SYSLOG, etc., in Debian using SNAT/DNAT/IP Tables; helped win Apple evaluation
  • Reduced TRILL convergence time from 500ms to 5ms for Fujitsu's Japan deployment

Aricent Group

Oct 2011 – Nov 2013

Software Engineer

Worked on switching solutions for EPON OLT used in Japan's broadband infrastructure.

  • Developed enhancements for Sumitomo Electric's FSU 7100 EPON OLT used by Time Warner Cable
  • Implemented NTP support using socket-based integration with open NTPD
  • Developed and tested features on IS-IS routing protocols in Aricent's Intelligent Switching Solution product

Media Coverage

Selected media coverage highlighting features where I have been a key contributor.

Copilot Search will make you want to use Bing again. I gave the AI search tool a try at Microsoft's 50th Anniversary Event in Redmond, but it's available for most Bing users right now.

Microsoft has begun rolling out a new Copilot Search mode for its Bing search engine, integrating artificial intelligence to enhance search results with more personalised and context-aware responses.

Business Standard Read Article

Microsoft has launched its answer to Google's AI-powered search experiences: Bing generative search.

TechCrunch Read Article

As first reported by Windows Latest, the new AI-related feature in Bing's Copilot will roll out across many platforms. Microsoft's silent release indicates that it is gearing up to directly challenge Google's AI Search.

Tech Times Read Article

Microsoft's Bing search engine is leaning further in to artificial intelligence, with a new test feature that summarizes search results for users.

Education

University of Utah

M.S. Computer Science

Thesis : Switch Assisted Peer To Peer

A novel Peer to Peer transfer protocol that utilizes programmable switches to reduce the network trunk traffic in data centers in multicast situations read more

Key Courses

Machine Learning Natural Language Processing Advanced Algorithms Data Mining

Anna University

B.E. Computer Science

Project : Implementation of Realtime Face Detection using Viola Jones object detection framework

I led my group in implementing a Viola-Jones face detection framework using MATLAB, with a focus on optimizing latency for seamless integration into real-time video frames.read more

Key Courses

Data Structures Computer Networks Operating Systems Database Systems Software Engineering

Resume

Explore my professional journey and discover how I can bring value to your next project.

View Now