Lead Python Developer
We have a great skill set in big data technologies, cloud computing, and scalable data solutions, and we would love to speak with you if you have expertise in Python, Apache Spark, SQL, and cloud platforms like AWS.
About You – experience, education, skills, and accomplishments- 5+ years of hands-on experience in Python software development.
- Strong expertise in Apache Spark and PySpark for scalable data solutions.
- Advanced proficiency in SQL, database design, and relational database systems.
- Experience developing and deploying solutions on cloud platforms (AWS – S3, ECS, EMR, Glue, RDS).
- Familiarity with orchestration tools like Apache Airflow.
- Understanding of modern data lake architectures and familiarity with data storage formats such as Delta Lake and Apache Iceberg.
- Solid grasp of CI/CD principles and experience implementing automated pipelines.
- Bachelor’s degree in computer science, Software Engineering, or a related field or equivalent relevant experience.
- Experience with Databricks and Snowflake.
- Exposure to Cassandra, Hadoop, and Apache Hive.
- Experience using Jupyter notebooks and BI tools like Tableau or Power BI.
- Familiarity with PL/SQL and Oracle GoldenGate.
- Master’s degree or Ph.D. in Computer Science or a related discipline.
- Providing technical leadership – Guide the development team in implementing Python-based solutions, ensuring alignment with business requirements and scalability best practices.
- Developing applications – Design, build, and maintain scalable Python applications for data processing, transformation, and analytics.
- Big data integration – Leverage Apache Spark, PySpark, and related big data technologies to develop distributed data processing pipelines.
- Database management – Utilize SQL expertise to design efficient data models, optimize queries, and ensure data quality.
- Enforcing best practices – Implement and uphold software development best practices, including testing and CI/CD pipelines.
- Continuous learning – Stay updated with evolving Python and big data technologies, recommending improvements to enhance the data platform.
- Mentorship & collaboration – Mentor junior developers and foster a culture of continuous learning within the team.
Product you will be developing
You will be working on high-performance data engineering solutions that support the Life Sciences industry, helping to accelerate drug discovery and commercialization. The platform leverages technologies such as Python, Apache Spark, PySpark, SQL, AWS, Apache Airflow, and modern data lake architectures.Our solutions focus on scalability, efficiency, and data integrity, ensuring businesses can make data-driven decisions with confidence.
About the Team
You will be joining a collaborative, high-performing team focused on building innovative data solutions. The team consists of data engineers, software developers, and cloud specialists, working closely with data scientists, product managers, and external stakeholders.We thrive on agility, continuous improvement, and knowledge sharing to drive excellence in data engineering.
Hours of Work
This is a Full-Time permanent role, in Barcelona – Spain (40 working hours per week)
Hybrid working model, 2-3 days per week from our Office, near Sagrada Familia, central region in the city.
What we can offer you:
- An opportunity to have a real impact on the global IP industry.
- 30 working days of vacation
- Volunteering community, with 40 paid hours of volunteering time
- Private Health and Life & Disability insurances.
- Tax-free benefits (Ticket Restaurant scheme, kindergarten, and transport cards).
At Clarivate, we are committed to providing equal employment opportunities for all persons with respect to hiring, compensation, promotion, training, and other terms, conditions, and privileges of employment. We comply with applicable laws and regulations governing non-discrimination in all locations.