£600 - £675 per day
Duration: 6 Months
My client is building out a data catalogue for Research & Development teams. The data catalogue will: be a registry of internal and external data products, digitise access governance and automate access provision. As such the Data Catalogue will be foundational in making our data products: findable, accessible and re-usable.
The aim is to create a scalable solution to support the development of data communities and valuable data products. To make this solution scalable we need to provide the services and tools to our partners in the R&D in a secure, compliant, stable, sustainable way.
As the Data Catalogue Lead you will lead the build and run of the data catalogue as a capability for R&D and IT.
Collibra is the data catalogue technology. Existing teams will be making use a range of data engineering products to acquire, ingest and curate metadata into the data catalogue (including Talend and AWS Glue). This will include implementing metadata models, building governance workflows, automating granting of access and building out APIs.
Essential skills and experience
*You will have experience of building or developing team,
*Technical leadership in a data domain,
*You will be able to demonstrate an ability to understand business needs and translate them into a solution,
*You will be able to design and document development best practices,
*You will need great interpersonal skills & a collaborative approach to delivery.
Desirable skills and experience
*It is highly desirable that you have experience developing and managing a data catalogue or similar,
*Experience configuring and managing a SaaS system,
*A highly available system,
*Metadata best practices and design principles,
*Legal issues surrounding data re-use, especially in a pharmaceutical organisation (e.g. PII, GxP, primary & secondary use of data),
*Experience of big data, ETL & cloud techniques and tools (we currently use Talend. Redshift (inc. Spectrum), Glue, EMR, HIVE, PIG, Spark, S3, SQS, SNS),
*You have experience of technical leadership in data and analytics,
*Building and maintaining APIs over data services,
*Experience working with systems integrators