We are seeking a GenAI Engineer to design, develop, and deploy Generative AI solutions that enhance business workflows and user experiences. The ideal candidate will have strong expertise in LLMs (Large Language Models), prompt engineering, and integration of AI services into scalable applications.
Key Responsibilities
Model Integration: Implement/fine-tune LLMs; build APIs/microservices for GenAI features.
Prompt Engineering: Design, optimize, and evaluate prompts for safety and accuracy.
RAG (Retrieval-Augmented Generation): Develop pipelines for document ingestion, vector embeddings, and semantic search.
App Dev: Integrate GenAI into web/mobile apps using FastAPI, Streamlit, or React.
Optimization: Monitor token usage, latency, and inference costs.
Safety: Implement moderation, bias detection, and responsible AI guidelines.
Required Skills
Python (FastAPI, Flask, Django), LLM APIs (OpenAI, Azure), Vector DBs (Pinecone, Weaviate, FAISS).
Cloud (AWS/Azure/GCP), Docker/K8s, ML fundamentals (embeddings, tokenization).
Real-time AI (SSE/WebSockets).
Preferred Skills
LangChain, LlamaIndex, Image models (Stable Diffusion), MLOps, CI/CD.
Technical Deep-Dive: Vector Embeddings
Since the JD specifically asks for knowledge of embeddings and vector databases, your engineers should be prepared to answer the following:
Python, FastAPI, Flask, Django, Llm Apis, Openai, Azure Openai, Prompt Engineering, Retrieval Augmented Generation, Vector Embeddings, Vector Databases, Pinecone, Weaviate, Faiss, Semantic Search, Cloud Computing, Aws, Azure, Gcp, Docker, Kubernetes, Microservices, Api Development, Ml Fundamentals, Tokenization, Real Time Ai, Server Sent Events, Websockets, Langchain, Llamaindex, MLOps, Ci Cd, Genai Integration, Application Development, Model Monitoring, Cost Optimization, Ai Safety, Bias Detection