We are looking for a skilled Python Data Engineer to design and build scalable, real-time data systems that power high-performance applications. This role is ideal for engineers passionate about asynchronous Python, data pipelines, and working with NoSQL databases. You'll be part of a collaborative, cross-functional team working on advanced backend systems and data-driven APIs.
Key Responsibilities:
• Python: Proficient in modern Python development with strong understanding of asynchronous programming using async/await, coroutines, and event loops.
• Tornado Framework: Experienced in building and maintaining high-performance, non-blocking REST APIs using Tornado, with a focus on scalability and low-latency design.
• MongoDB & NoSQL: Skilled in data modeling, schema design, and query optimization for MongoDB; hands-on experience with Motor, the async MongoDB driver.
• API Design & Integration: Strong grasp of RESTful principles, endpoint structuring, and efficient handling of large or streaming data payloads.
• Data Pipelines: Familiar with ingesting, transforming, and validating structured and unstructured data for ETL and real-time workflows.
• Code Architecture: Applies clean code principles with a focus on modular, maintainable architecture using service layers, repository patterns, and separation of concerns.
• Validation & Data Integrity: Implements schema validation and data consistency checks in dynamic and schema-less environments.
• Testing & Quality Assurance: Proficient in writing unit and integration tests for async code using tools like pytest and pytest-asyncio.
• DevOps Awareness: Comfortable working with Git, Docker, and CI/CD practices for deploying Tornado applications in containerized environments.
• Monitoring & Reliability: Implements logging, error handling, and performance monitoring for robust and observable async services.
• Collaboration & Communication: Experienced in working cross-functionally with data scientists, backend teams, and product stakeholders to deliver data-driven features and APIs.
• Command Line & macOS Proficiency: Familiarity with macOS environments and strong command-line skills are appreciated, especially for development, tooling, and scripting workflows.
• AI & Agent-Based Preferences (Nice to Have): Experience with AI-generated code, agent-to-agent communication frameworks, and integrating or replacing parts of logic through LLM-driven or autonomous agent orchestration.
Required Skills:
• 5+ years of experience in Python development, with a strong focus on asynchronous architecture.
• Hands-on experience with the Tornado framework or similar async web frameworks (FastAPI, aiohttp).
• Strong understanding of NoSQL and document databases, especially MongoDB.
• Deep knowledge of RESTful API design, microservices, and working with large or streaming data sets.
• Familiarity with containerized environments using Docker, Git workflows, and CI/CD tools.
• Excellent debugging, problem-solving, and communication skills.
• Proven ability to work in cross-functional teams and deliver production-grade data systems.
AWS, Python, Data Pipeline, Mongo DB, AI, Command Line & macOS, Asynchronous architecture, Tornado framework, Async web frameworks (FastAPI, aiohttp), NoSQL databases, RESTful API design, Microservices, Large data sets, Streaming data, Docker, Git, CI/CD tools, Debugging, Problem-solving, Communication skills, Cross-functional teamwork, Production-grade data systems.