:Responsibilities
-
Prepare and preprocess data for AI models, develop features, and collaborate with R&D and Product teams to ensure the effective deployment and monitoring of machine learning models
-
Build and maintain scalable ML data pipelines, ensuring seamless integration with AI workflows
-
Analyze and integrate data from various sources, ensuring high-quality data for use in models
-
Optimize data processing systems for performance and cost-efficiency, automating routine data handling tasks
-
Collaborate closely with data team to document data processes and infrastructure, providing training and support when needed
-
Monitor the performance of deployed models, analyze outputs, and recommend data-driven improvements
-
Ensure data security, compliance with regulations, and implementation of data governance policies
Requirements:
-
Bachelor's degree in Computer Science, Data Science, Mathematics, or a related field
-
Basic understanding of data science concepts and workflows, including data preprocessing and feature engineering
-
Proficiency with programming languages commonly used in data science, such as Python or R
-
Familiarity with database systems (SQL, NoSQL) and data integration concepts
-
Knowledge of machine learning frameworks and libraries (e.g., TensorFlow, Scikit-learn, PyTorch) is an advantage
-
Experience or knowledge of ETL tools and pipeline design is an added bonus
-
Strong analytical and problem-solving skills
-
Excellent communication and teamwork abilities
-
Eagerness to learn and adapt to new technologies and methodologies