Lead - Data Engineer
Apply now »Posted On: 12 Feb 2026
Location: Noida, UP, India
Company: Iris Software
Are you ready to do the best work of your career at one of India’s Top 25 Best Workplaces in IT industry? Do you want to grow in an award-winning culture that truly values your talent and ambitions?
Join Iris Software — one of the fastest-growing IT services companies — where you own and shape your success story.
At Iris Software, our vision is to be our client’s most trusted technology partner, and the first choice for the industry’s top professionals to realize their full potential.
At Iris, every role is more than a job — it’s a launchpad for growth.
Job Description
• The Lead Data Engineer is a strategic and technical leadership role responsible for architecting, scaling, and evolving enterprise-grade data platforms that enable advanced analytics, AI/ML, and data-driven decision-making. Reporting to the Senior Director of Data Platforms, this role will lead the design and governance of modern data architectures, drive innovation in AI orchestration, and ensure the delivery of secure, compliant, and high-performing data solutions.
• This position combines hands-on engineering expertise with architectural vision and cross-functional leadership. The Lead Data Engineer will guide engineering teams, influence platform strategy, and establish best practices across the organization’s data ecosystem.
Basic Qualifications :
• Bachelor’s or Master’s degree in Computer Science, Engineering, Data Science, or related field.
• 8+ years of experience in data engineering and architecture, with a proven track record of leading large-scale data initiatives.
• Deep expertise in Python, PySpark.
• Strong hands-on experience with Databricks (Spark, Delta Lake, Workflows)
• Strong experience with AWS (S3, IAM, Textract, Bedrock or equivalent)
• Experience with design and implement scalable document ingestion pipelines using Databricks Auto Loader and AWS S3.
• Understanding of vector embeddings and semantic search
• Strong understanding of data governance, privacy, and compliance in regulated industries (healthcare, life sciences).
Good to Have :
• Advanced knowledge of data modeling, lakehouse/lake/warehouse design, and performance optimization.
• Familiarity with generative AI platforms and use cases.
• Contributions to open-source projects or thought leadership in data engineering/architecture.
• Experience with Agile methodologies, CI/CD, and DevOps practices.
• Exposure to FastAPI, or API-based ML services
• Experience evaluating LLM output quality
Key Responsibilities :
• Lead Engineering Teams: Provide technical leadership and mentorship to data engineers, fostering a culture of excellence, innovation, and continuous improvement.
• AI/ML Enablement: Collaborate with Data Science and ML Engineering teams to operationalize models, implement AI orchestration frameworks (e.g., MLflow, Airflow), and ensure scalable deployment pipelines.
• Platform Strategy & Governance: Define and enforce architectural standards, data governance policies, and compliance frameworks (HIPAA, SOC 2, GDPR, etc.) across the data platform.
• Performance & Reliability Optimization: Drive observability, automation, and performance tuning across data pipelines and infrastructure to ensure reliability at scale.
• Cross-Functional Collaboration: Partner with product, analytics, compliance, and infrastructure teams to align data architecture with business goals and regulatory requirements.
• Innovation & Thought Leadership: Stay ahead of industry trends, evaluate emerging technologies, and contribute to strategic decisions on platform evolution, including generative AI integration and event-driven systems.
Mandatory Competencies
Perks and Benefits for Irisians
Iris provides world-class benefits for a personalized employee experience. These benefits are designed to support financial, health and well-being needs of Irisians for a holistic professional and personal growth. Click here to view the benefits.