Job Title: Data Scientist – Extraction and Automation
Location: Washington, DC (Hybrid)
Department: Development
As a Data Scientist, you will focus on developing and assessing machine learning and large language models to provide accessible, relevant, high performing and cost-sensitive solutions to our Data Extraction, Automation, eDiscovery, Cyber Incident Response and Data Governance clients.
You will also play a critical role in designing, deploying, and orchestrating AI Agents—autonomous and semi-autonomous systems built on large language models and task-specific workflows—to handle complex data challenges. These agents will support tasks such as document review automation, legal classification, privilege detection, and data summarization.
Additionally, you will work closely with cross-functional teams using the latest GenAI tools to build agents that would extract intelligence and automate actions like identify and redact PXI and other sensitive data, legal privilege, legal relevance, and a variety of other tasks for legal professionals.
Responsibilities:
• Own the full data science lifecycle, from data preparation, development/modeling, to evaluation.
• Design, implement, and deploy scalable data pipelines and AI/ML models in production environments.
• Collaborate with cross-functional teams – including Product Managers, Subject Matter Experts, Developers, etc., – to understand customer requirements and optimize existing products.
• Build compelling, innovative new experiences that provide actionable, cost-effective and thoughtful solutions for our customers.
• Participate in research, case studies, and development to explore and leverage cutting-edge technologies.
• Apply techniques from statistics, signal processing, natural language processing, machine learning, generative AI, and other areas to develop new algorithms and power advanced features in our data management platform.
• Make improvements to how we validate, test, and deploy data science algorithms and models.
• Measure the impact of the experiences the team’s building.
Required Qualifications:
• Bachelor’s degree in Statistics, Computer Science, Mathematics, or other related discipline. Masters’ degree is a plus.
• Strong proficiency in coding, data structures, and algorithms.
• Comprehensive understanding of statistics, optimization, and machine learning.
• Excellent problem-solving skills and ability to work in a fast-paced environment.
• Strong ability to take an obscurely defined task and break it down into clear, actionable steps.
• Solid knowledge of machine learning algorithms and advanced statistical concepts and methods.
• Able to pass a Public Trust Clearance, at minimum.
Preferred qualifications:
• Experience with Data Extraction, Automaton, eDiscovery, Cyber Data Mining, Data Governance or other aspects of advanced technology in the legal sphere.
• Experience at a legal service provider, law firm or other legal-centric employer working with ML, NLP, LLM’s, or other advanced technologies.
• Exposure to Federal government projects.
• Experience working on a workflow orchestration product, extensible software platform and/or software development kit (SDK) development
• Strong familiarity with machine learning methods and technology
Why Join Us?
Impact: Play a critical role in shaping the future of iCONECT solutions and making a difference in building initiatives.
Innovation: Work in a fast-paced, innovative environment with cutting-edge technology and a collaborative team.
Growth Opportunities: We prioritize professional development and provide opportunities for growth within the organization.
Culture: Join a diverse and inclusive workplace that values creativity, integrity, and teamwork.