Open to consulting & collaboration

Nirbhay
Singh

> |

I architect secure multi-cloud platforms, build AI/ML deployment pipelines, and ship open-source tools that save engineering teams real money.

11+ Years Experience
3 Cloud Platforms
14+ Certifications
4M+ Saved via FinOps
Nirbhay Singh
nirbhay@cloudtoai:~
Scroll

Building the infrastructure
that powers AI at scale.

Cloud & AI Architect with 11+ years designing production systems across AWS, GCP, and Azure. I bring together automation, observability, and FinOps to deliver platforms that are secure, scalable, and cost-efficient.

Currently at Bosch Polska, I lead multi-cloud strategy and AI platform delivery — integrating Prompt Engineering, LLMs, and Agentic AI into enterprise search and automation systems using Vertex AI, BigQuery, and Cloud Run.

I'm also an active open-source contributor, building tools like ShieldIaC, TokenMeter, and InfraCents — solving real problems in cloud security, AI cost management, and infrastructure visibility.

Location Warsaw, Poland
Current Role Cloud & AI Architect @ Bosch
Focus Multi-Cloud, MLOps, GenAI, FinOps
Education MTech — Birla Institute of Technology
Languages English, Hindi

Skills & Technologies

Cloud Platforms

AWS GCP Azure VMware Multi-Cloud

Infrastructure as Code

Terraform CloudFormation Ansible Puppet HCL

Containers & Orchestration

Kubernetes Docker EKS GKE AKS Anthos

CI/CD & DevOps

GitHub Actions Jenkins GitOps ArgoCD CI/CD

AI / ML / GenAI

Vertex AI MLOps LLMOps RAG Prompt Engineering Agentic AI SageMaker

Security & Observability

IAM Zero-Trust Prometheus Grafana ELK FinOps

Languages & Scripting

Python Bash YAML JSON HCL PowerShell

Data & Analytics

BigQuery Redshift EMR Cloud Run Enterprise Search

Work Experience

Dec 2022 — Present

Cloud & AI Architect

Bosch Polska · Warsaw, Poland
  • Integrated Prompt Engineering, LLMs, and Agentic AI into enterprise search and automation, deploying workloads with Vertex AI, BigQuery, and Cloud Run
  • Designed LLM and enterprise search solutions adopted across multiple business units in Bosch Europe
  • Led strategic partnership between Google, AWS, and Bosch in Poland
  • Implemented FinOps at Bosch Europe, delivering €4M+ in savings through optimization and governance
  • Mentoring senior developers and architects on multi-cloud strategy across global teams
GCP AWS Vertex AI LLMs FinOps BigQuery
Jun 2021 — Nov 2022

Senior DevOps Consultant

McAfee
  • Automated infrastructure with Terraform (IaC), achieving 50%+ reduction in deployment time
  • Deployed and managed container platforms across GKE, EKS, AKS, and ECS
  • Built monitoring, logging, and alerting systems to proactively resolve incidents
  • Developed automation scripts in Python, Bash, YAML, and JSON for cloud infrastructure
Terraform AWS GCP Azure Kubernetes
Dec 2018 — Jun 2021

Senior DevOps Engineer / Data & AI Architect

Deloitte
  • Reduced AWS costs by $45k and accelerated CI/CD pipeline speed by 30×
  • Orchestrated 1,000+ containers using AWS Batch, Spot Instances, and EKS
  • Built data & AI platforms on EMR, Redshift, and SageMaker
  • Implemented multi-cloud security spanning GCP, AWS, Azure, and OCI
AWS EKS SageMaker Redshift CI/CD
Nov 2015 — Dec 2018

Senior Cloud Infrastructure Engineer

PTC
  • Designed secure, fault-tolerant AWS architectures (EC2, RDS, S3, VPC)
  • Built disaster recovery strategies for cloud and on-premise environments
  • Managed 11,000+ server Windows infrastructure with Active Directory
  • Automated operations with PowerShell and led large-scale migrations
AWS Active Directory PowerShell DR
Mar 2013 — Jul 2014

System Engineer

IBM
  • Infrastructure operations, migrations, and incident response (P1/P2)
  • Root cause analysis and resolution of critical production issues
Infrastructure Migrations Incident Response

Featured Projects

ShieldIaC

AI-Powered IaC Security Scanner

Catch security misconfigurations in Infrastructure-as-Code before they reach production. 100+ rules across 9 compliance frameworks (CIS, SOC 2, HIPAA, PCI-DSS, NIST, ISO 27001) with AI-powered fix suggestions.

100+ Security Rules 9 Compliance Frameworks AI Fix Suggestions
Python Terraform CloudFormation GPT-4.1 GitHub Actions

TokenMeter

Cost Intelligence Layer for LLM Apps

Track every token. Optimize every dollar. One-line integration that replaces a single import — automatically tracking cost, latency, and tokens across OpenAI, Anthropic, and Google. Smart routing saves up to 60% by matching task complexity to model capability.

<5ms Overhead 60% Cost Savings 3 Providers
Python OpenAI Anthropic Smart Routing PyPI

InfraCents

Terraform Cost Estimates on Every PR

An open-source GitHub App that posts real-time cloud cost estimates directly on pull requests. Parses Terraform changes, queries live pricing APIs from AWS and GCP, and tells your team exactly how much a PR will cost before it merges. Zero config required.

AWS+GCP Pricing Zero Config PR Integration
Python Next.js Terraform GitHub App AWS API
More Open Source

AgentLoom

Observable Agent Workflows with Python Decorators

Build, trace, and debug multi-step agent workflows with zero boilerplate. Decorator-based step definitions with automatic timing, input/output capture, async support, and conditional branching with configurable retries.

21+ Tests Async Support Zero Boilerplate
Python Pydantic AsyncIO Click Structlog

Airlock

Drop-in LLM Security Proxy

One-line security layer for LLM APIs. Automatically redacts PII, detects prompt injection attempts, enforces rate limits, and tracks cost budgets — 100% local, no external services or data leaving your infrastructure.

6 Injection Detectors 31+ Tests 100% Local
Python FastAPI Pydantic Uvicorn Structlog

DataMint

Synthetic Eval Datasets for LLM Testing

Generate Q&A pairs, adversarial red-team prompts, and structured tabular data for LLM evaluation pipelines. No API keys, no LLM required. Fully deterministic with seed control — 1,000+ items per second.

1K+/sec No API Needed 37+ Tests
Python Faker Jinja2 Click Pydantic

ModelLedger

MLOps Compliance & Lineage Tracking

Track model versions, dataset provenance, and experiment history. Generate EU AI Act and NIST AI RMF compliance reports in one command. Fully offline — no database, no cloud setup required.

EU AI Act NIST RMF 48+ Tests
Python Jinja2 Pydantic Click Structlog

TuneForge

Fine-tune & Serve Open-Source LLMs in 3 Commands

LoRA fine-tuning with 4-bit quantization, built-in evaluation metrics, and FastAPI model serving — all in one CLI. GPU optional: validate your config locally and train on cloud when ready.

LoRA Fine-tuning 4-bit Quantization 38+ Tests
Python Transformers PEFT FastAPI HuggingFace

Certifications

Google Cloud

Professional Machine Learning Engineer Valid through Dec 2027
Professional Cloud DevOps Engineer Valid through Dec 2027
Professional Cloud Architect Valid through Jun 2026
Generative AI Leader Valid through Aug 2028
Associate Cloud Engineer Earned Aug 2023

Amazon Web Services

Solutions Architect — Professional Earned Feb 2020
Solutions Architect — Associate Earned Jan 2021

Other

CKAD — Certified Kubernetes Application Developer The Linux Foundation · Valid through Sep 2026
Terraform Associate HashiCorp
MCSA — Microsoft Certified Solutions Associate Microsoft · Earned Apr 2015

Education

Master of Technology (M.Tech)

Birla Institute of Technology and Science CGPA: 8.3

Master of Computer Applications (MCA)

SRM University

Bachelor of Science in IT (BSc IT)

Kuvempu University

Let's Build Something Together

Whether it's cloud architecture consulting, AI/ML platform strategy, or open-source collaboration — I'm always open to interesting conversations and new challenges.