Senior Data Scientist – Generative AI
🏢 Company: Wellhub
📍 Location: Portugal
🌍 Remote: Yes
💼 Type: Full-time
💰 Salary: Not specified
📋 Overview
Remote Senior Data Scientist role at Wellhub focused on GenAI, multimodal extraction, and unstructured data processing in Portugal.
📝 Job Description
About Wellhub
Wellhub is a leading corporate wellness platform connecting employees to fitness, mindfulness, nutrition, and therapy partners through a unified subscription. Based in NYC with a global team, we prioritize wellbeing. Recent rebranding reflects our evolution from a gym pass provider to a comprehensive wellness solution.
Role Impact
As a Senior Data Scientist in the Generative AI team, you’ll leverage cutting-edge AI to transform unstructured content:
- Apply advanced tools (OCR, vision-language models, document understanding) for data extraction
- Develop prompt engineering strategies for LLM-powered content structuring
- Create systems for data quality assurance, bias reduction, and ethical compliance
- Design human-in-the-loop feedback mechanisms
- Architect scalable ML pipelines
Collaborate cross-functionally while promoting wellbeing principles at work.
Your Impact
You’ll develop and deploy GenAI solutions for:
- Multimodal data extraction across diverse formats
- LLM-enhanced content transformation and summarization
- Automated data validation and filtering
- Data architecture for ethical ML/extract processes
- Master’s (or PhD) in CS, Data Science, ML, Statistics or related field
- Proven Python expertise with NLP/OCR toolkits
- Demonstrated experience in LLM prompt engineering
- Track record of unstructured data processing expertise
- Gold-level Wellhub subscription
- Additional fitness subsidies
- Flexible hybrid/remote work models
- Paid time off + parental leave benefits
- Growth opportunities in a supportive, global tech environment
- Master’s or PhD degree in Computer Science, Data Science, Machine Learning, Statistics, or relevant engineering discipline.
- Strong proficiency in Python programming with demonstrated experience using libraries for NLP, data extraction, and OCR.
- Deep understanding of LLM mechanisms within multimodal contexts, including principles of prompt engineering and few-shot learning.
- Proven record of processing and structuring unstructured data from diverse sources using methods like web scraping, OCR, API integration, and document analysis.
- Excellence in analytical problem-solving, systematically transforming noisy datasets into high-quality, usable formats for machine learning.
- Exceptional written and verbal communication skills to effectively collaborate and influence technical and product stakeholders.
- Evidence of deploying or supporting data systems in production environments (preferred).
- Comprehensive health and wellness benefits including free access to premium Wellhub plans.
- Generous paid time off policy (minimum 25 days/year plus anniversary days)
- Fully remote or flexible hybrid work options
- Competitive parental leave program (100% paid)
- Home office stipend
- Monthly flexible work allowance
- Professional development opportunities and career growth support
Qualifications:
Preferred: Practical experience with VLMs (BLIP, LayoutLM etc.), production-deployed data pipelines.
Benefits
✅ Qualifications
🎁 Benefits
Direct application link to company career page
🔖 Tags: None
📱 Follow us: Telegram | LinkedIn
Source: Curated from public job boards. Verify details on the employer site before applying.