Running Language Models Locally
Feb 18, 2025
Language models are transforming the way we interact with AI, with services like ChatGPT, Copilot, Gemini, and Perplexity offering seamless conversational ..
I'm a Senior Software Engineer at Microsoft AI, where I build generative AI solutions with a focus on generative search and large language model (LLM) applications deployed on distributed systems. Outside of work, I explore ethical challenges in LLMs and conduct research aimed at addressing critical gaps in current AI research systems, with a particular focus on long-term privacy and responsible model behavior.
Feb 18, 2025
Language models are transforming the way we interact with AI, with services like ChatGPT, Copilot, Gemini, and Perplexity offering seamless conversational ..
Feb 2, 2025
Yes, it does. Though it doesn't divulge that information easily, it may be possible to extract PII with specialized attacks. In the last post What makes LLM memorize things?
Jan 26, 2025
As we enter 2025, LLMs have become ubiquitous, and it's now widely understood that they can retain information from their training data. We've all witnessed this in ..
Copilot Search seamlessly blends the best of traditional and generative search together to help you find what you need.
I work on identifying and integrating enrichments—such as answer cards, videos, and images—with LLM-generated text responses. Along with my team, I build and maintain the online enrichment engine that serves millions of users every day.
This experience combines the foundation of Bing's search results with the power of large and small language models (LLMs and SLMs). It understands the search query, reviews millions of sources of information, dynamically matches content, and generates search results in a new AI-generated layout to fulfill the intent of the user's query more effectively.
I'm one of the core founding team members behind Bing's Generative Search result generation framework (patent pending). Together with my team, I designed and built the initial framework for enrichment data retrieval, LLM/RAG-based enrichment pairing techniques, and LLM-driven evaluation and metrics. This pipeline has powered the generation of millions of enriched result pages, significantly enhancing user experience at scale.
I co-developed PANORAMA, a large-scale synthetic dataset of 384K samples from 9.6K realistic human profiles to model the distribution and context of PII in online content. This was developed specifically to help researchers working in the space to reliably evaluate their mitigation strategies and quantify various modeling techniques for their efficacy at not remembering sensitive private information.
Using OpenAI's o3-mini, we generated diverse web-native formats—social posts, reviews, wikis, and more—with embedded sensitive data. By fine-tuning Mistral-7B across varying data replication rates, we analyzed memorization patterns across content types, providing insights into privacy risks in LLM training. PANORAMA enables robust model auditing and privacy-preserving research, with open-source data and tooling.
This feature helps user get tailored LLM generated responses for their tasks with minimal prompting. Feature focuses on various areas such as email writing, story creation, code queries and creation.
I developed the prompts and the prompt templating engine that powers this experience. In addition, I was responsible for assembling the UX customization panel, enabling flexible and user-friendly interactions.
Senior Software Engineer
Worked across multiple teams in Bing including Copilot Search, Generative Search, Creator, Personas, and Knowledge Graph enrichment using LLMs.
Technical Lead Engineer
Led and contributed to protocol design, data center networking, and kernel-level enhancements for major clients including Ciena, Fujitsu, and Apple.
Software Engineer
Worked on switching solutions for EPON OLT used in Japan's broadband infrastructure.
Selected media coverage highlighting features where I have been a key contributor.
Copilot Search will make you want to use Bing again. I gave the AI search tool a try at Microsoft's 50th Anniversary Event in Redmond, but it's available for most Bing users right now.
PCMag Read Article
Microsoft has begun rolling out a new Copilot Search mode for its Bing search engine, integrating artificial intelligence to enhance search results with more personalised and context-aware responses.
Business Standard Read Article
Microsoft has launched its answer to Google's AI-powered search experiences: Bing generative search.
TechCrunch Read Article
As first reported by Windows Latest, the new AI-related feature in Bing's Copilot will roll out across many platforms. Microsoft's silent release indicates that it is gearing up to directly challenge Google's AI Search.
Tech Times Read Article
Microsoft's Bing search engine is leaning further in to artificial intelligence, with a new test feature that summarizes search results for users.
CNET Read Article
A novel Peer to Peer transfer protocol that utilizes programmable switches to reduce the network trunk traffic in data centers in multicast situations read more
I led my group in implementing a Viola-Jones face detection framework using MATLAB, with a focus on optimizing latency for seamless integration into real-time video frames.read more
Explore my professional journey and discover how I can bring value to your next project.
View Now