DZone
Thanks for visiting DZone today,
Edit Profile
  • Manage Email Subscriptions
  • How to Post to DZone
  • Article Submission Guidelines
Sign Out View Profile
  • Post an Article
  • Manage My Drafts
Over 2 million developers have joined DZone.
Log In / Join
Refcards Trend Reports
Events Video Library
Refcards
Trend Reports

Events

View Events Video Library
Core Badge
Avatar

Jubin Abhishek Soni

DZone Core CORE

Senior Software Engineer at Yahoo

US

Joined May 2025

About

Jubin Soni is a Senior Software Engineer with 14+ years of experience building scalable systems, real-time data pipelines, and AI-driven platforms for industry leaders in technology and media. With deep expertise spanning cloud-native architectures, distributed systems, and applied machine learning, Jubin brings a rare combination of engineering depth and research breadth to every problem he tackles. He is a published researcher with work appearing in IEEE and other peer-reviewed venues, and a Manning Publications author. Jubin holds IEEE Senior Member status and has spoken at technical conferences including P99 CONF, ACM and APIdays, sharing his expertise in distributed systems, serverless architectures, and AI with engineering communities globally. He is passionate about pushing the boundaries of what scalable software can do — and sharing those insights with fellow engineers through writing, research, and open source.

Stats

Reputation: 2017
Pageviews: 87.6K
Articles: 38
Comments: 3

Expertise

Microservices Expertise Icon

Microservices

  • Articles
  • Comments

Articles

article thumbnail
Amazon OpenSearch Vector Search Explained for RAG Systems
Use Amazon OpenSearch k-NN as your RAG vector store. Build a small Python example: create the index, embed docs, search by meaning.
June 9, 2026
· 690 Views
article thumbnail
Amazon Quick: AWS's Agentic Workspace, Explained for Engineers
A technical deep dive into Amazon Quick — how it works, how it connects to your tools via MCP, and where it sits in the AWS agent stack.
June 9, 2026
· 787 Views
article thumbnail
S3 Vectors: How to Build a RAG Without a Vector Database
You don't need a vector database anymore. S3 Vectors gives you native vector storage + similarity search, serverless, at up to 90% less cost
May 19, 2026
· 1,333 Views
article thumbnail
Run Gemma 4 on Your Laptop: A Hands-On Guide to Google's Latest Open Multimodal LLM
Gemma 4 — architecture deep-dive, thinking mode, function calling, audio + image input, and a small project — no cloud, no API keys.
May 19, 2026
· 3,340 Views · 2 Likes
article thumbnail
The Agent Protocol Stack: MCP vs. A2A vs. AG-UI
Three protocols are shaping how AI agents interact with tools, other agents, and users. Here's what each one does, how they fit together, and when to reach for which.
May 15, 2026
· 2,601 Views
article thumbnail
AWS Kiro: The Agentic IDE That Makes Specs the Unit of Work
You describe a feature, and Kiro generates structured requirements.md, design.md, and tasks.md files first, then implements against them.
May 13, 2026
· 2,882 Views · 2 Likes
article thumbnail
I Gave Gemini 3 My Worst Legacy Code — Here’s What Happened
Feeding legacy code to Gemini 3 reveals key refactoring lessons: prioritize modularity, decouple logic, and use AI to automate unit tests.
May 7, 2026
· 2,669 Views · 1 Like
article thumbnail
Engineering LLMOps: Building Robust CI/CD Pipelines for LLM Applications on Google Cloud
Master LLMOps on GCP by automating prompt evaluation, model deployment, and monitoring with Cloud Build and Vertex AI for robust AI apps.
May 5, 2026
· 1,858 Views
article thumbnail
5 Ways Azure AI Search Enhances Enterprise RAG Architectures
Azure AI Search enhances RAG through hybrid search, semantic reranking, and enterprise-grade security for scalable production AI apps.
April 30, 2026
· 3,266 Views
article thumbnail
What AWS Kiro Matters for Agentic Development
AWS Kiro is a high-speed communication fabric for AI agents, enabling sub-millisecond coordination and persistent memory management.
April 29, 2026
· 2,179 Views
article thumbnail
65% of Enterprises Will Deploy Agentic AI by 2027: A Deep Technical Analysis of Readiness
Agentic AI is the next frontier for enterprises. This guide covers technical architectures, multi-agent design, and deployment readiness.
April 28, 2026
· 2,868 Views
article thumbnail
Architecting Autonomous Agents: A Deep Dive into Azure AI Foundry Agent Service
Build enterprise-ready AI agents using Azure AI Foundry Agent Service with integrated tools, state management, and robust security.
April 27, 2026
· 1,977 Views
article thumbnail
Gemini + Veo: A Deep Dive into Google’s High-Fidelity Video Generation Pipeline
A deep dive into Google's video generation stack, combining Gemini's reasoning with Veo's cinematic 1080p video production pipeline.
April 23, 2026
· 2,293 Views
article thumbnail
Gemini Agent vs Microsoft Copilot vs ChatGPT Operator: How They Compare
AI is shifting from chat to action. Learn how Agentic AI, Copilots, and Operators differ in architecture, capability, and autonomy.
April 20, 2026
· 1,796 Views
article thumbnail
AI-Powered Dev Workflows: How SWEs Are Shipping Faster in 2026
Boost your velocity with AI-orchestrated workflows. Learn best practices for prompt engineering, automated reviews, and secure code generation.
April 17, 2026
· 2,821 Views · 1 Like
article thumbnail
Architecting the Future of Research: A Technical Deep-Dive into NotebookLM and Gemini Integration
Explore how NotebookLM and Gemini 1.5 Pro revolutionize research through source grounding, long context windows, and content pipelines.
April 15, 2026
· 2,651 Views
article thumbnail
Mastering Gemma 4
Master Gemma 4 with this deep dive into its architecture, distillation training, and Python implementation for production AI workflows.
April 15, 2026
· 3,170 Views
article thumbnail
MCP + AWS AgentCore: Give Your AI Agent Real Tools in 60 Minutes
A hands-on walkthrough on building an AI agent with real tools using AWS Bedrock AgentCore Runtime. FastMCP and the Strands agent.
April 8, 2026
· 3,641 Views
article thumbnail
Beyond the LLM: Why Amazon Bedrock Agents Are the New EC2 for AI Orchestration
Amazon Bedrock Agents are the EC2 of the agentic era, providing scalable, managed infrastructure for complex AI reasoning workflows.
April 7, 2026
· 2,635 Views
article thumbnail
Mastering Azure Kubernetes Service: The Ultimate Guide to Scaling, Security, and Cost Optimization
Learn to optimize AKS with automated scaling, robust security policies, and cost-saving techniques for high-performance cloud clusters.
April 2, 2026
· 2,905 Views · 1 Like
article thumbnail
Getting Started with Gemini Agents: Build a Data-Connected RAG Agent using Vertex AI Agent Builder
Build a production-ready RAG agent using Gemini and Vertex AI Agent Builder to query your private data with high accuracy and speed.
March 31, 2026
· 1,201 Views
article thumbnail
Stateful AI: Streaming Long-Term Agent Memory With Amazon Kinesis
Stream every agent interaction into vector memory for real-time, scalable, persistent, fully queryable, and context-rich AI memory.
March 26, 2026
· 2,206 Views
article thumbnail
Mastering Serverless Architecture: Event-Driven Design with Azure Functions and Cosmos DB
A comprehensive guide to building serverless event-driven systems using Azure Functions and Cosmos DB, featuring real-world patterns.
March 25, 2026
· 1,618 Views
article thumbnail
Google Cloud AI Agents With Gemini 3: Building Multi-Agent Systems That Actually Work
Build and scale multi-agent systems using Gemini 3 on Google Cloud Vertex AI, featuring code and architecture for technical experts.
March 12, 2026
· 5,485 Views · 2 Likes
article thumbnail
Implementing Decentralized Data Architecture on Google BigQuery: From Data Mesh to AI Excellence
Traditional centralized data lakes don’t scale for AI. A Data Mesh not only decentralizes data ownership by domain but also enforces federated governance.
March 3, 2026
· 2,660 Views · 1 Like
article thumbnail
AWS Step Functions + AI: Smarter Orchestration in Modern Applications
Learn to combine AWS Step Functions with Generative AI for scalable orchestration, featuring architecture patterns and code examples
March 2, 2026
· 1,595 Views
article thumbnail
Mastering the AWS Well-Architected AI Stack: A Deep Dive into ML, GenAI, and Sustainability Lenses
Use AWS’s ML, GenAI, and Sustainability lenses together to build AI systems that are production-ready, governed, cost-efficient, and energy-efficient.
February 27, 2026
· 1,657 Views · 1 Like
article thumbnail
Unified Intelligence: Mastering the Azure Databricks and Azure Machine Learning Integration
Bridge the gap between Big Data and production ML. Learn to integrate Azure Databricks with Azure Machine Learning for a seamless, scalable end-to-end MLOps workflow.
February 27, 2026
· 1,234 Views
article thumbnail
AWS Bedrock vs. SageMaker: Choosing the Right GenAI Stack in 2026
Deciding between Bedrock's serverless ease and SageMaker's deep control? This guide breaks down the 2026 AWS GenAI landscape for you.
February 26, 2026
· 1,403 Views · 1 Like
article thumbnail
Azure SLM Showdown: Evaluating Phi-3, Llama 3, and Snowflake Arctic for Production
Evaluate Phi-3, Llama 3, and Snowflake Arctic. Learn to deploy cost-effective, high-performance SLMs on Azure for production workloads.
February 23, 2026
· 1,467 Views

Comments

AWS Kiro: The Agentic IDE That Makes Specs the Unit of Work

May 20, 2026 · Jubin Abhishek Soni

Really appreciate that, Georgi, thanks for taking the time to read it!

Enhancing SQL Server Performance with Query Store and Intelligent Query Processing

Apr 14, 2026 · arvind toorpu

Great article! thanks for sharing

Mastering Azure Kubernetes Service: The Ultimate Guide to Scaling, Security, and Cost Optimization

Apr 14, 2026 · Jubin Abhishek Soni

Thanks, appreciate it! Great question, AKS autoscaling behaves similarly to self-managed Kubernetes since it uses the same HPA + Cluster Autoscaler components. HPA reacts quickly to spikes, but node scaling still has some lag due to provisioning time. AKS mainly makes this more reliable and easier to manage, but for sharp bursts, you may still need buffer capacity or tools like KEDA.

User has been successfully modified

Failed to modify user

  • RSS
  • X
  • Facebook

ABOUT US

  • About DZone
  • Support and feedback
  • Community research

ADVERTISE

  • Advertise with DZone

CONTRIBUTE ON DZONE

  • Article Submission Guidelines
  • Become a Contributor
  • Core Program
  • Visit the Writers' Zone

LEGAL

  • Terms of Service
  • Privacy Policy

CONTACT US

  • 3343 Perimeter Hill Drive
  • Suite 215
  • Nashville, TN 37211
  • [email protected]

Let's be friends:

  • RSS
  • X
  • Facebook