We use cookies. Find out more about it here. By continuing to browse this site you are agreeing to our use of cookies.
#alert
Back to search results

AI Engineer II - ML Deployment

Sorenson Communications
medical insurance, paid holidays, sick time, 401(k)
United States, Utah, Salt Lake City
4283 590 West (Show on map)
Sep 24, 2024
Description

Come be a part of our mission and make a meaningful and positive impact with the industry leading provider of language services for the Deaf and heard-of-hearing!

Benefits



  • Paid Vacation Time and Paid Sick Time and Paid Holidays
  • 401k 6% match with immediate vesting
  • Nationwide Medical Insurance plans and coverage (Medical, Dental/Orthodontia, Vision)


    • TeleDoc
    • HSA company match
    • 3 Medical plan options including a Low Deductible PPO Medical Plan Offering


  • Employee Assistance Program
  • Engaged Employee Resource Groups
  • Outstanding Learning and Career Development Opportunities



Pay Range: Actual pay may vary up or down depending on job-related factors which may include knowledge, skills, experience, and location. In addition, this position may be eligible for incentive compensation.

Essential Duties and Responsibilities



    • Bring new ML technology developed in the AI Lab into production through close collaboration with AI Lab scientists and software engineers.


    • Apply good software engineering practices to create robust, production-ready, ML-integrated software systems.


    • Develop and maintain APIs and services to support both real-time and batch ML-integrated systems


    • Containerize model pipelines using Docker so that they can be deployed on cloud platforms.


    • Continuously refine and optimize ML model pipelines for performance, scalability, and reliability in production environments.


    • Troubleshoot and resolve issues related to ML model serving.



    Supervisory Responsibility

    This position has no direct supervisory responsibilities but does serve as a coach and mentor for other

    positions in the department.

    Travel Requirements

    Travel Requirements: Less than 25%

    Education

    Minimum 4 Year / Bachelors Degree in Computer Science, Computer Engineering, Physics, Mathematics, or a related field.

    Preferred: Graduate Degree

    Minimum Years of Experience

    5years of experience in software and AI roles. A Master's degree may be considered equivalent to 2-4 YOE. A Phd may be considered equivalent to 1-3

    YOE.

    Knowledge, Skills, and Abilities



    • Proven experience in developing production-grade AI systems powered by neural networks, with a focus on performance and scalability.


    • Demonstrated proficiency in Python and at least one other language like C/C++ or Java.


    • Excellent understanding of software development process, particularly in the context of transitioning ML models into production-ready systems.


    • Possess strong teamwork skills, a positive demeanor, and effective communication abilities.


    • Experience with ML model serving runtimes like Triton Inference Server, and deployment tools such as Docker or Kubernetes.


    • Experience with either gRPC, websockets, or async APIs necessary for streaming/online services.


    • Skill in profiling and optimizing ML model pipelines for efficient model serving with inference backends like OnnxRuntime.


    • Experience with model optimization techniques such as quantization for efficient inference in real-world applications.


    • A solid understanding of the key factors and best practices for achieving highly performant ML model serving.


    • Ability to write comprehensive unit tests for ML-integrated software systems.


    • Passion for continuous learning and staying updated with the latest AI tooling and research.


    • Experience with cloud computing services like AWS for model deployment is advantageous.


    • Experience with implementing optimized algorithms for low-latency ML systems in the speech and natural language processing domain, such as beam-search decoding, is highly desirable.



    Company Summary

    Our Mission...Harnessing the power of language, we connect diverse people and enrich the human experience.

    Our Vision...To provide global language services that expand opportunities, nurture belonging, and empower the world to connect beyond words.

    As one of the world's leading language services providers, Sorenson combines patented technology with human-centric solutions. We strive to increase diversity, equity, inclusion, and accessibility for underrepresented people through communication solutions for all: call captioning and video relay services, over-video and in-person sign language and spoken language interpreting, translation, real-time captioning, and post-production language services.

    Sorenson's impact vision and plan extends to supporting employment opportunities for diverse employees, customers, and communities. As a minority-owned company, we are committed to expanding opportunities for underserved communities while promoting an inclusive workplace for our own employees.

    Equal Employment Opportunity:
    Sorenson Communications is an Equal Opportunity, Affirmative Action Employer.

    Equal Opportunity Employer/Protected Veterans/Individuals with Disabilities
    The contractor will not discharge or in any other manner discriminate against employees or applicants because they have inquired about, discussed, or disclosed their own pay or the pay of another employee or applicant. However, employees who have access to the compensation information of other employees or applicants as a part of their essential job functions cannot disclose the pay of other employees or applicants to individuals who do not otherwise have access to compensation information, unless the disclosure is (a) in response to a formal complaint or charge, (b) in furtherance of an investigation, proceeding, hearing, or action, including an investigation conducted by the employer, or (c) consistent with the contractor's legal duty to furnish information. 41 CFR 60-1.35(c)
    Applied = 0

    (web-5fdf5b7fb4-dtl8q)