Home
/
Data and Analytics
/
Senior Software Engineer - Data Scientist (OCR and Document Training)
Senior Software Engineer - Data Scientist (OCR and Document Training)-March 2024
New York
Mar 19, 2025
ABOUT CAPGEMINI
We focus on helping drive value for our customers in three key areas: customer experience, intelligent industry, and enterprise management.
10,000+ employees
Consulting, Information Technology
VIEW COMPANY PROFILE >>
About Senior Software Engineer - Data Scientist (OCR and Document Training)

  Job description:

  We are seeking highly skilled Data Scientists, specializing in Optical Character Recognition (OCR) and Document Training. Your primary mission will be to develop OCR solutions to extract information from documents and leverage Google's Document AI to train models on the underlying unstructured data. With your extensive experience in data science and data engineering expertise, you will play a pivotal role in transforming unstructured documents into actionable insights.

  Key Responsibilities:

  a) Develop OCR Solutions: One of your primary responsibilities will be to design and develop OCR solutions capable of accurately extracting textual and structural data from various types of documents.

  b) Leverage Document AI: You will harness the power of Google's Document AI to train models on the extracted unstructured data. This process will involve structuring, categorizing, and making sense of large volumes of textual information.

  c) Experience and Expertise: With a minimum of 5 years of experience in the field of Data Science, you will bring a deep understanding of machine learning, natural language processing, and computer vision. Your expertise will be instrumental in solving complex OCR and document training challenges.

  d) Data Engineering Experience: In addition to your data science skills, you should have experience in data engineering, particularly in handling key-value pairs and structuring unstructured data for effective analysis. This skill set will be essential for preprocessing and structuring data before applying machine learning models.

  Technical Requirements:

  To excel in these roles, you should have expertise in:

  • Optical Character Recognition (OCR) technologies

  • Machine learning and deep learning

  • Natural language processing (NLP)

  • Computer vision

  • Google's Document AI or similar document analysis tools

  • Data preprocessing and data engineering

  • Programming languages like Python, SQL, and relevant libraries/frameworks

  Life at Capgemini

  Capgemini supports all aspects of your well-being throughout the changing stages of your life and career. For eligible employees, we offer:

  Flexible work Healthcare including dental, vision, mental health, and well-being programsFinancial well-being programs such as 401(k) and Employee Share Ownership PlanPaid time off and paid holidays Paid parental leaveFamily building benefits like adoption assistance, surrogacy, and cryopreservationSocial well-being benefits like subsidized back-up child/elder care and tutoringMentoring, coaching and learning programsEmployee Resource Groups Disaster Relief

  About Capgemini

  Capgemini is a global leader in partnering with companies to transform and manage their business by harnessing the power of technology. The Group is guided everyday by its purpose of unleashing human energy through technology for an inclusive and sustainable future. It is a responsible and diverse organization of over 360,000 team members in more than 50 countries. With its strong 55-year heritage and deep industry expertise, Capgemini is trusted by its clients to address the entire breadth of their business needs, from strategy and design to operations, fueled by the fast evolving and innovative world of cloud, data, AI, connectivity, software, digital engineering and platforms. The Group reported in 2022 global revenues of €22 billion.

  Get The Future You Want | www.capgemini.com

  Disclaimer

  Capgemini is an Equal Opportunity Employer encouraging diversity in the workplace. All qualified applicants will receive consideration for employment without regard to race, national origin, gender identity/expression, age, religion, disability, sexual orientation, genetics, veteran status, marital status or any other characteristic protected by law.

  This is a general description of the Duties, Responsibilities and Qualifications required for this position. Physical, mental, sensory or environmental demands may be referenced in an attempt to communicate the manner in which this position traditionally is performed. Whenever necessary to provide individuals with disabilities an equal employment opportunity, Capgemini will consider reasonable accommodations that might involve varying job requirements and/or changing the way this job is performed, provided that such accommodations do not pose an undue hardship.

  Capgemini is committed to providing reasonable accommodations during our recruitment process. If you need assistance or accommodation, please reach out to your recruiting contact.

  Click the following link for more information on your rights as an Applicant http://www.capgemini.com/resources/equal-employment-opportunity-is-the-law

  Please be aware that Capgemini may capture your image (video or screenshot) during the interview process and that image may be used for verification, including during the hiring and onboarding process.

  Applicants for employment in the US must have valid work authorization that does not now and/or will not in the future require sponsorship of a visa for employment authorization in the US by Capgemini.

  Capgemini discloses salary range information in compliance with state and local pay transparency obligations. The disclosed range represents the lowest to highest salary we, in good faith, believe we would pay for this role at the time of this posting, although we may ultimately pay more or less than the disclosed range, and the range may be modified in the future. The disclosed range takes into account the wide range of factors that are considered in making compensation decisions including, but not limited to, geographic location, relevant education, qualifications, certifications, experience, skills, seniority, performance, sales or revenue-based metrics, and business or organizational needs. At Capgemini, it is not typical for an individual to be hired at or near the top of the range for their role. The base salary range for the tagged location is $145,000 to $187,000.This role may be eligible for other compensation including variable compensation, bonus, or commission. Full time regular employees are eligible for paid time off, medical/dental/vision insurance, 401(k), and any other benefits to eligible employees.Note: No amount of pay is considered to be wages or compensation until such amount is earned, vested, and determinable. The amount and availability of any bonus, commission, or any other form of compensation that are allocable to a particular employee remains in the Company's sole discretion unless and until paid and may be modified at the Company's sole discretion, consistent with the law.

Comments
Welcome to zdrecruit comments! Please keep conversations courteous and on-topic. To fosterproductive and respectful conversations, you may see comments from our Community Managers.
Sign up to post
Sort by
Show More Comments
SIMILAR JOBS
Program Manager TS/SCI with CI Poly REQUIRED
Program Manager TS/SCI with CI Poly REQUIRED Position Description This is a tremendous opportunity to influence high-level decision makers in the government through a transformational advisory role.
Clinical Technologist/Technologist Trainee- 2nd Shift | Medical Drug Monitoring LCMS Instrument/Data Analysis
MedTox Laboratories is a subsidiary of Laboratory Corporation of America (LabCorp). The integration of LabCorp and Covance in 2015 makes LabCorp the largest health care diagnostic company in the worl
Stage : Déploiement d'un nouvel outil de documentation des activités de validation de systèmes/équipements, Belgique - 2024
Site Name: Belgium-Wavre Posted Date: Nov 21 2023 Aidez-nous à devancer la maladie en participant à notre programme de stages Formation requise : Vous êtes étudiant(e) en Bachelier/Master en Administ
Modelling/Forecasting Senior Specialist
Hours 40 Department Overview The Platform Delivery Team is responsible for coordinating and/or testing changes or enhancements to the components of the MLE Platform (Model Lifecycle Environment), a L
CIP Hygiene Technician - Rexdale, ON
Background & Purpose of the Job Help us Bring Out the Best! Unilever is now recruiting for a CIP Technician at our Rexdale facility where we produce Hellmann's Mayonnaise. In this role you will w
Retail Stores Associate II
Become part of the Converse Team Converse is a place to explore potential, break barriers and push out the edges of what can be. The company looks for people who can grow, think, dream and create. It
AIML - Sr Engineering Program Manager, ML Data & Infrastructure
Summary Posted: Nov 8, 2023 Weekly Hours: 40 Role Number:200519113 Imagine what you could do here. At Apple, new ideas have a way of becoming extraordinary products, services, and customer experience
Camera Software - Computational Photography/Machine Learning Research Engineer
Summary Posted: Nov 10, 2023 Weekly Hours: 40 Role Number:200519262 The Camera Algorithms team is looking for passionate, self-driven computer vision/computational photography research engineers who
Associate Director - Digital Transformation
Site Name: Bengaluru Luxor North Tower Posted Date: Nov 29 2023Your role will bring technical expertise with project management acumen, ensuring seamless coordination and the successful implementatio
Oracle Hyperion DRM Admin
Oracle Hyperion DRM Admin Position Description CGI is looking for an experienced Oracle Hyperion DRM Administrator responsible for configuration, migration, troubleshooting, testing, performance tuni
Copyright 2023-2025 - www.zdrecruit.com All Rights Reserved