Data Engineer

  • Mitek
  • San Diego, CA, United States
  • Jul 26, 2017
Full time Data

Job Description

Mitek (NASDAQ: MITK) is an innovator in Artificial Intelligence and Machine Learning that simplify everyday tasks, is seeking qualified candidates for a Data Engineer role, based in San Diego. 

Mitek is a global leader in mobile capture and identity verification software solutions. Mitek’s ID document verification allows an enterprise to verify a user’s identity during a mobile transaction, enabling financial institutions, payments companies and other businesses operating in highly regulated markets to transact business safely while increasing revenue from the mobile channel. Mitek also reduces the friction in the mobile users’ experience with advanced data prefill. These innovative mobile solutions are embedded into the apps of more than 5,500 organizations and used by tens of millions of consumers for mobile check deposit, new account opening, insurance quoting, and more. 

We have a track record of breakthrough achievements that have helped to transform mobile banking and the identity authentication markets. As a result, we have grown globally with offices in the UK and the Netherlands and are listed on the NASDAQ.  

We’re looking for team members that live our core values of Delivering, Learning and Caring.



What You’ll Do (Role Description)

Mitek is looking for a Data Engineer to join our software engineering team. We are changing the way users perform their day-to-day banking activities through our patented mobile document capture technology. Our products include Mobile Verify, Mobile Fill, Mobile Docs, Mobile Deposit, and Commercial Mobile Deposit Capture. This is an great opportunity for someone looking to work at a fast growing company who specializes in mobile and cloud technologies.

The Data Engineer will be a key member of a newly formed cross functional data team responsible to drive best-in-class algorithms as a core asset of the Mitek product set.  To do this, the engineer will partner across the technical functions (R&D, Engineering, QA, and Product) to define data requirements for machine learning, testing, product measurement and evaluation activities. The engineer will design, build, and integrate solutions to harvest and label data from various data pipelines. 

Mitek leverages large data sets to train and evaluate our deep learning solutions which are cross platform and cutting edge. For this reason, we need someone who is technically sharp, a creative problem solver, productive working independently or collaboratively, and a successful analyst. 

You will:

  • Own data collection activities to harvest, ingest, and label data at scale, using a variety of techniques including scripting, writing queries and calling internal and external APIs.
  • Define and drive efficient processes for data collection, preparation, labelling, and collation.
  • Work with the product & technology teams to implement recommendations and algorithms into production.
  • Process unstructured data into a form suitable for analysis.
  • Support the business with routine and ad-hoc analysis (as needed).
  • Create reporting dashboards as needed to support effective communication and status.
  • Recommend and drive acquisition of tooling and infrastructure used for these processes.
  • Implementation of security and data protection policies and procedures.


Who You Are (Soft Skills/Attributes)

  • Self-starter and entrepreneurial mindset
  • Thrives in a fast-paced start-up team-focused culture and adapts to a changing environment
  • Data-driven, strategic mindset
  • Logical and creative problem-solving
  • Excellent interpersonal and relationship management skills
  • Planning, organization, and facilitation skills
  • Ability to manage and influence others (both within and outside your own direct work-group)
  • Ability to summarize complex issues simply and effectively


What You Need (Skills/Experience/Abilities)

  • 5+ Years of software development experience
  • Development/scripting language – Python
  • Experience using Python to build processes around data transformation, data structures, and metadata
  • API consumption using RESTful web services and JSON
  • Image processing and image manipulation libraries and associated technologies


Additional Preferred Experience

  • Amazon Web Services
  • Proficient understanding of code versioning tools such as Git
  • Exposure to Big Data platforms and technologies such as Amazon EMR and Hadoop
  • Exposure and experience with Amazon Mechanical Turk
  • Understanding of and experience in ETL (extract, transform, load) processes
  • Strong knowledge of and experience with statistics
  • Knowledge in data mining, machine learning, natural language processing, or information retrieval.
  • Exposure to SQL and NoSQL databases and document stores such as SQL Server, MongoDB, RavenDB
  • Experience processing large amounts of structured and unstructured data.  MapReduce experience is a plus.
  • Prior experience in secure practices of handling sensitive data and PII


Skills & Education

  • B.S. in Information Systems, Computer Science or a related field
  • Demonstrated ability to work with ambiguous requirements, adapt, and learn
  • Excellent written and verbal communication
  • Meticulous attention to detail and excellent problem solving/troubleshooting skills
  • Strong experience and comfort with use of Microsoft Office tools
  • Understanding of Agile development and test-driven-development techniques