GenAI (Core)
- Python-first implementation: Build LLM/RAG services in Python (FastAPI/asyncio, Pydantic), with clean APIs and tests.
- Agentic AI workflows: Tool-using agents (planning, memory, multi-step execution), function/tool calling, and recovery/fallback paths.
- Streaming responses: Server-streamed tokens from the model to the UI.
- RAG pipelines: Ingestion, chunking, embeddings, indexing, reranking; grounded answers with citations (pgvector/FAISS/Pinecone).
- Prompt management: Versioning, templates/params, safe fallbacks and rollback.
- Evaluations: Hallucination/groundedness checks, small regression suite for prompts/retrievers.
- Guardrails: PII detection/redaction, content safety, domain constraints.
- Perf & cost: Context reduction, caching/batching, request pacing/rate-limit handling.
- Multimodal (basic): Text + files; ASR/TTS as a plus.
- Telemetry: Token usage, grounding rate, answer length, latency, errors/timeouts.
Integration (Core)
- API layer: Design/ship REST (or gRPC) endpoints that orchestrate tools, retrieval, and post-processing (citations/formatting).
- Streaming UX: SSE/WebSockets with retries, backoff, partial responses, and user cancellation.
- MCP (Model Context Protocol): Implement/consume MCP tool adapters (files, web, DB/connectors), capability negotiation, auth/permissions, and resource limits for safe tool access.
- Vector stores & data sources: Integrate pgvector/FAISS/Pinecone plus DB/file stores and enterprise connectors (e.g., SharePoint/Confluence/Slack/Jira).
- AuthN/Z & tenancy: JWT/OAuth, role/tenant isolation, secrets management, audit logging for AI actions.
- Observability: Logs, metrics, traces; dashboards/alerts for latency, error rate, token spend.
- Queues/workflows: Background jobs for ingestion/summarization with idempotency and retries.
- Data governance: Input/output validation, schema contracts, PII handling, retention policies.
- CI/CD: For prompts, retrieval configs, and API changes; blue/green or canary releases.
Job Category: AI/ML
Job Type: Full Time
Job Location: Bangalore