What We’re Looking For
- Strong understanding of statistical analysis, hypothesis testing, and pattern recognition
- Strong understanding of data modelling concepts and the ability to design effective data models.
- Proficiency in data engineering tools and languages (e.g., Python, SQL, Apache Spark).
- Hands-on experience fine-tuning and deploying LLMs (OpenAI, Anthropic, open-source models)
- Solid understanding of NLP and prompt engineering
- Experience with data analysis and statistical methods for extracting actionable insights
- Experience building data pipelines and ETL workflows (e.g., Airflow, Prefect, or similar)
- Familiarity with MLOps practices (model versioning, monitoring, deployment)
- Comfortable working in an agile environment
- Curious, reliable, and self-driven engineer
- Proficiency in English, both written and verbal
Nice to Have
- Experience with data visualization tools (Plotly, Matplotlib, Tableau) for client reporting
- Experience with LangChain, LlamaIndex, or similar AI orchestration frameworks
- Background in healthcare data or working with sensitive/regulated data
- Familiarity with cloud ML platforms (GCP Vertex AI, AWS SageMaker, Azure ML)
- Knowledge of SQL and relational databases
- Understanding of API design for ML services
- Experience building and fine-tuning proprietary LLMs for specific use cases and optimizing their performance
- Experience with Docker and containerized ML applications