Azure Databricks Data Engineer
Brak informacji o wynagrodzeniu
MidFull-time·Umowa o pracę
#321386·Dodano 28 dni temu·24
Źródło: theprotocol.itTech Stack / Keywords
DatabricksDelta LakeSparkSQLPythonMLflowPower BI
Firma i stanowisko
Capgemini is a global leader in partnering with companies to transform and manage their business by harnessing the power of technology. The Group is guided everyday by its purpose of unleashing human energy through technology for an inclusive and sustainable future. It is a responsible and diverse organization of over 360,000 team members globally in more than 50 countries. With its strong 55-year heritage and deep industry expertise, Capgemini is trusted by its clients to address the entire breadth of their business needs, from strategy and design to operations, fueled by the fast evolving and innovative world of cloud, data, AI, connectivity, software, digital engineering and platforms.
Wymagania
- Hands-on experience designing and developing data pipelines with Databricks.
- Strong working knowledge of Delta Lake, Spark, and Databricks notebooks.
- Proficiency in SQL and Python for data transformation and analysis.
- Familiarity with data modeling, data warehousing, and data architecture principles.
- Understanding of data governance and security frameworks (experience with Unity Catalog is a plus).
- Experience with tools such as MLflow, Power BI, or similar technologies.
- Solid problem-solving and troubleshooting skills for data and pipeline issues.
- Effective communicator and collaborator in cross-functional environments.
- Bachelor’s degree in Computer Science, Information Systems, Engineering, or a related field.
- Databricks certification (e.g., Databricks Certified Data Engineer Associate or Professional) is an advantage.
Obowiązki
- Design, develop, and maintain ETL/ELT pipelines and workflows using Databricks.
- Collaborate with cross-functional teams to understand data requirements and deliver high-quality datasets.
- Build and manage data lake and data warehouse architectures to support analytics and reporting.
- Optimize data workflows for performance, scalability, and cost efficiency.
- Ensure data quality, consistency, and reliability across all data platforms.
- Apply data governance and security standards, leveraging tools such as Unity Catalog.
- Integrate Databricks with enterprise systems and tools (e.g., Delta Lake, MLflow, Power BI, Mosaik SDK, Agent Bricks).
- Monitor, test, and troubleshoot data pipelines to ensure production stability and performance.
- Contribute to best practices in data engineering and participate in code reviews.
Oferta
- Medical care with Medicover.
- Private life insurance.
- Sports card.
- Capgemini Helpline offering therapeutical support.
- Educational podcast "Let's talk about wellbeing".
- Access to over 70 training tracks with certification opportunities on NEXT platform.
- Free access to Education First languages platform, Pluralsight, TED Talks, Coursera, and Udemy Business materials and trainings.
- Continuous feedback and ongoing performance discussions via GetSuccess.
- Hybrid working model with home office package (laptop, monitor, chair).
- Sharing the costs of sports activities.
- No dress code.
- Parking space for employees.
- Extra social benefits.
- Redeployment package.
- Employee referral program.
- Charity initiatives.
- Access to courses (e.g., Excel, VBA, RPA, Customer Care).
- Unlimited access to Udemy Business.
- Free chat/call with a therapist.
Karta sportowa
Opieka zdrowotna
Ubezpieczenie
Parking dla aut
Dofinansowanie szkoleń
Płatny urlop
Bonusy
Capgemini Polska Sp. z o.o.
133 aktywne oferty