Core AI Models (LLMs & Generative AI)

OpenAI (GPT-4, GPT-4o, GPT-3.5)

Anthropic Claude (Claude 3 family)

xAI Grok (Elon Musk’s LLM for X/Twitter)

DeepSeek (Chinese LLM with efficiency focus)

Google Gemini

Meta LLaMA (LLaMA 2, 3)

Mistral (Mixtral, small/efficient open LLMs)

Cohere (Command R for RAG, Embeddings API)

Ollama (run open-source LLMs locally on Mac/Linux/Windows)

Stability AI (Stable Diffusion) – image generation

Runway (AI video generation)

ElevenLabs (AI voice generation)
Frameworks & Orchestration
LangChain
chaining LLMs + tools + APIs
LlamaIndex (ex-GPT Index)
retrieval-augmented generation (RAG)
Flowise AI
no-code visual builder for LangChain
Haystack (by deepset.ai)
open-source RAG & search framework
LangGraph
graph-based LLM agent orchestration
Haystack (by deepset.ai)
autonomous AI agents
Vector Databases
(for RAG, embeddings, semantic search)
Pinecone
managed, scalable, easy to integrate
Weaviate
open-source, modular vector DB
Milvus / Zilliz
high-performance, enterprise-level
Chroma
lightweight, often used in prototyping
Qdrant
open-source, strong filtering capabilities
FAISS (Facebook AI Similarity Search)
library for fast vector search
Automation & Workflow
RPA (Robotic Process Automation):
- UiPath
- Automation Anywhere
- Blue Prism
n8n → open-source Zapier alternative
Zapier / Make (Integromat) → SaaS automation
Apache Airflow → data pipelines, scheduling
Prefect → modern data workflow orchestration
Temporal → long-running workflows, reliability

MLOps / AI Ops
(Model lifecycle & monitoring)
Weights & Biases (W&B)
open-source lifecycle management
MLflow
open-source lifecycle management
Kubeflow
Kubernetes-native ML platform
LangSmith (by LangChain)
debugging, tracing, evaluation for LLM apps
Arize AI
LLM observability & monitoring
Evidently AI
data & model quality monitoring

Infrastructure & Deployment
Cloud AI Platforms:
- AWS Bedrock (hosted foundation models)
- Azure OpenAI / Cognitive Services
- Google Vertex AI
- IBM Watsonx.ai
On-device / Local Inference:
- Ollama
- LM Studio
- vLLM (optimized inference engine)
- TensorRT-LLM (NVIDIA GPU optimization)
Distributed/Scalable Serving:
- Ray Serve
- HuggingFace TGI (Text Generation Inference)
Containerization & Orchestration:
- Kubernetes
- Docker
Supporting AI Tools
Prompt Engineering:
- Techniques (few-shot, chain-of-thought, self-consistency, retrieval injection)
- Tools: Promptfoo, Guidance, DSPy
Evaluation Frameworks:
- HELM (Holistic Evaluation of LLMs)
- MMLU, GSM-8K (benchmarks)
- TruLens, Ragas (RAG evaluation)
Data Labeling & Prep:
- Label Studio
- Snorkel
- Scale AI
Agents & Tooling:
- HuggingFace Agents
- AutoGPT, CrewAI, LangGraph

Business Integration (Enterprise AI)
CRM/ERP AI:
- Salesforce Einstein AI
- SAP Business Technology Platform (BTP) AI
- Oracle AI in Fusion Cloud
BI + Analytics AI:
- Microsoft Power BI Copilot
- Tableau GPT
- Qlik Sense + AI
APIs for Custom Integration:
- FastAPI
- Flask
- gRPC
Enterprise Platforms:
- ServiceNow AI
- Notion AI
- Slack GPT
