Data Scientist

Contentstack India Pvt Ltd

Posted: about 2 years ago

Company Website
https://cutshort.io/jo...
Position type
full time
Job source
Cutshort
Category
programming
Remote
No
Salary
---
Job location
India
About

What is Contentstack?

Contentstack combines the best Content Management System (CMS) and Digital Experience Platform (DXP) technology. It enables enterprises to manage content across all digital channels and create inimitable digital experiences. The Contentstack platform was designed from the ground up for large-scale, complex, and mission-critical deployments. Recently recognized as the Gartner PeerInsights Customers' Choice for WCM, Contentstack is the preferred API-first, headless CMS for enterprises across the globe. 

 

What Are We Looking For?

Contentstack is looking for a Data Scientist.

 

Role & Responsibilities:

  1. Analyze raw data, create and maintain optimal data pipeline architecture to improve data reliability and quality.
  2. Deep knowledge and hands-on experience in technologies across all data lifecycle stages.
  3. Experience and deep knowledge about Content Management Systems would be highly preferred.
  4. Build the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of data sources.
  5. Work closely with the Data Scientists and provide them with quality data for advanced analytics.
  6. Experience building and optimizing data pipelines, architectures and data sets.
  7. Strong analytical skills related to working with NoSQL databases and unstructured datasets.


Technical Requirements:

  1. At least 4+ years of experience as a data scientist with expertise on NLP and Deep Learning.
  2. Good knowledge on ETL and ELT tools and pipelines, Data APIs.
  3. Technical expertise with data models, data mining, and segmentation techniques
  4. Experience supporting, maintaining and monitoring data pipelines (real time & batch)
  5. Knowledge of programming languages (Python or Java).
  6. Experience with relational SQL and NoSQL databases (MongoDB).
  7. Experience with big data tools: Hadoop, Spark, Kafka, etc.
  8. Degree in Computer Science, IT, or similar field. Master’s degree is a plus
  9. Demonstrated ability to develop advanced CI/CD pipelines in GitLab, Python, Apache Airflow (e.g. using dynamic data pipelines)

  10. Experience with one or more of MLOps tools: ModelDB, Kubeflow, Pachyderm, and Data Version Control (DVC) etc.

  11. Experience in Docker, Kubernetes, Jenkins, Spark (Highly preferred)

Skills:- CICD

Subscribe to our daily job alerts

Sign up for our newsletter to stay up to date with new jobs posted on Profilehunt

Please confirm your email address once you subscribe.