About the role
Babel Street is the trusted technology partner for the world’s most advanced identity intelligence and risk operations. We deliver advanced AI and data analytics solutions providing unmatched, analysis-ready data regardless of language, proactive risk identification, 360-degree insights, high-speed automation, and seamless integration into existing systems. Babel Street empowers government and commercial organizations to transform high-stakes identity and risk operations into a strategic advantage. The actionable insights we deliver safeguard lives and protect critical assets around the world. Babel Street is headquartered in Reston, Virginia, with regional offices in Boston, MA and Cleveland, OH, and international offices in Australia, Canada, Israel, Japan, and the U.K. For more information, visit www.babelstreet.com.
ROLE SUMMARY:
As an Engineer on the Image & Computer Vision AI team, you will play a hands-on role in developing and deploying computer vision capabilities that support Babel Street’s intelligence applications. You will build systems that extract, analyze, and reason over visual data—enabling facial matching, object and scene understanding, geolocation and location inference from imagery, and multimodal intelligence workflows.
This role is execution-focused and suited for engineers with strong foundations in computer vision, image processing, and machine learning who want to apply their skills to real-world, mission-driven problems. You will work closely with AI, Product, and Engineering teams to deliver reliable, scalable, and cost-efficient vision capabilities, including integration with multimodal LLM systems that allow users to search and reason over images using natural language.
This is a hybrid role to be based out of either our Reston, VA/Washington DC office or our Somerville MA office.
ROLE FOCUS;
This role spans three practical execution areas:
Computer Vision & Image Analytics
You will implement and operate image analytics pipelines that support facial matching, object detection, scene understanding, and image similarity. This includes image preprocessing, feature extraction, model inference, evaluation, and performance optimization to meet mission-grade accuracy and latency requirements.
Geospatial & Location Inference from Imagery
You will contribute to capabilities that infer location, context, or environmental attributes from imagery—leveraging visual cues, metadata, and learned representations. This includes supporting image-based geolocation, landmark recognition, and contextual scene analysis used in intelligence workflows.
Multi-Modal AI & Image Search
You will support multimodal AI systems that combine vision models with LLMs, embeddings, and retrieval pipelines to enable natural-language search and reasoning over images and image collections. You will help integrate visual understanding into broader intelligence applications and workflows.
KEY RESPONSIBILITIES:
- Build and maintain computer vision pipelines for image ingestion, preprocessing, inference, and evaluation.
- Implement facial matching, and identity-related vision workflows in accordance with accuracy, safety, and compliance requirements.
- Develop and support object detection, image similarity, and scene understanding models.
- Contribute to image-based geolocation and location inference capabilities using visual features and contextual signals.
- Support multimodal AI workflows that combine image embeddings with LLM-based search and reasoning.
- Write clean, maintainable Python code and contribute to production services and APIs.
- Assist with model evaluation, bias testing, and accuracy monitoring for vision systems.
- Optimize inference pipelines for performance, scalability, and cost efficiency (GPU usage, batching, model selection).
- Collaborate with Product and Engineering teams to integrate vision capabilities into user-facing intelligence applications.
QUALIFICATIONS:
Required
- 3+ years of experience in computer vision, image processing, or applied machine learning.
- Hands-on experience with computer vision models and techniques (e.g., CNNs, transformers for vision, feature embeddings).
- Experience building or integrating image analytics such as facial recognition, object detection, or image similarity.
- Strong programming skills in Python; experience with common CV/ML libraries (PyTorch, TensorFlow, OpenCV, etc.).
- Solid understanding of machine learning fundamentals, model evaluation, and performance tradeoffs.
- Experience working with large image datasets and production ML pipelines.
- Ability to work collaboratively in a fast-moving, mission-driven engineering environment.
Preferred
- Experience with facial matching or biometric systems in regulated or high-stakes environments.
- Experience with image-based geolocation or scene/location inference.
- Familiarity with multimodal AI systems, including combining vision models with LLMs or natural-language search.
EDUCATION:
Bachelor’s degree in Computer Science, Engineering, Data Science, or a related technical field required.
Advanced degree is a plus but not required.
Aplyr's read
Babel Street leverages advanced data analytics for intelligence solutions, attracting professionals skilled in AI, cloud engineering, and data management.
What's promising
- •Babel Street offers cutting-edge roles in AI and data analytics.
- •The company is expanding its expertise in generative AI technologies.
- •Strong focus on intelligence solutions provides competitive advantage.
What to watch
- •Limited public information about company culture and work-life balance.
- •High specialization may limit opportunities for generalists.
- •Potentially high-pressure environment due to focus on intelligence solutions.
Why Babel Street
- •Babel Street integrates public and proprietary data for comprehensive insights.
- •The company emphasizes roles in emerging AI fields.
- •Focus on intelligence solutions differentiates its product offerings.
Aplyr’s read is generated by AI from public sources. Was it useful?
About Babel Street
Babel Street is a technology company that specializes in data analytics and intelligence solutions, providing organizations with insights derived from public and proprietary data sources.
Similar roles
Sr Lead, Solutions Architect - Infrastructure, Cloud, Automation & AI Engineering
Northern Trust
Specialist - Gen AI Development
Sun Life
Automation & AI Product Owner
Rolls-Royce
Senior Business Analyst- ServiceNow Artificial Intelligence
Takeda
Senior AI Engineer
Takeda
Senior/ Lead Generative AI Developer/engineer
Citigroup