We are looking for a skilled Python Data Engineer to design and build scalable, real-time data systems that power high-performance applications. This role is ideal for engineers passionate about asynchronous Python, data pipelines, and working with NoSQL databases. You'll be part of a collaborative, cross-functional team working on advanced backend systems and data-driven APIs.
Key Responsibilities:
Python: Proficient in modern Python development with strong understanding of asynchronous programming using async/await, coroutines, and event loops.
Tornado Framework: Experienced in building and maintaining high-performance, non-blocking REST APIs using Tornado, with a focus on scalability and low-latency design.
MongoDB & NoSQL: Skilled in data modeling, schema design, and query optimization for MongoDB; hands-on experience with Motor, the async MongoDB driver.
API Design & Integration: Strong grasp of RESTful principles, endpoint structuring, and efficient handling of large or streaming data payloads.
Data Pipelines: Familiar with ingesting, transforming, and validating structured and unstructured data for ETL and real-time workflows.
Code Architecture: Applies clean code principles with a focus on modular, maintainable architecture using service layers, repository patterns, and separation of concerns.
Validation & Data Integrity: Implements schema validation and data consistency checks in dynamic and schema-less environments.
Testing & Quality Assurance: Proficient in writing unit and integration tests for async code using tools like pytest and pytest-asyncio.
DevOps Awareness: Comfortable working with Git, Docker, and CI/CD practices for deploying Tornado applications in containerized environments.
Monitoring & Reliability: Implements logging, error handling, and performance monitoring for robust and observable async services.
Collaboration & Communication: Experienced in working cross-functionally with data scientists, backend teams, and product stakeholders to deliver data-driven features and APIs.
Command Line & macOS Proficiency: Familiarity with macOS environments and strong command-line skills are appreciated, especially for development, tooling, and scripting workflows.
AI & Agent-Based Preferences (Nice to Have): Experience with AI-generated code, agent-to-agent communication frameworks, and integrating or replacing parts of logic through LLM-driven or autonomous agent orchestration.
Required Skills:
5+ years of experience in Python development, with a strong focus on asynchronous architecture.
Hands-on experience with the Tornado framework or similar async web frameworks (FastAPI, aiohttp).
Strong understanding of NoSQL and document databases, especially MongoDB.
Deep knowledge of RESTful API design, microservices, and working with large or streaming data sets.
Familiarity with containerized environments using Docker, Git workflows, and CI/CD tools.
Excellent debugging, problem-solving, and communication skills.
Proven ability to work in cross-functional teams and deliver production-grade data systems.
Python, Data Pipeline, Mongo DB, AI, Command Line & macOS