Petabyte Technology Inc: Senior Data Engineer


USA Only

About the company:

Petabyte Technology is a well funded, fast growing startup, combining best tech talent with pet industry veterans. We build data driven solutions to bring veterinarians and pet parents together on a single platform. For veterinarians, Petabyte brings best-in-class practice management software, Rhapsody, with industry leading veterinary vocabulary, mobile workflow, and scalability for corporate groups of all sizes, as well as best in class analytics solutions to unlock full potential of the practice. For pet parents, Petabyte brings a best-in-class mobile app, Boop, a membership-based experience to deeply engage with their veterinarians. Through Boop, pet parents have seamless access to book appointments, manage their pets medical records, sign-up for wellness/insurance programs, and ultimately build a rich, long-term relationship with their veterinarians. Petabyte is industry first by combining pet parents and veterinarians onto a single platform, serving the entire per industry with best in class software. Join our growing team and help us transform the pet industry!

About the role:

You will be a part of an amazing, fast growing team, working on flagship products of the company, Rhapsody and Analytics. As lead data engineer will own one of the key subsystems, processing and categorizing all the incoming data. You will be tasked with analyzing complex unstructured data sets, making sense of that data, creating efficient ingesting pipelines to convert the data into a format our system understands. Solving those kinds of problems will require creative thinking, ability to work independently and pation to solve complex data problems.



  • Computer Science degree or equivalent
  • Solid understanding of algorithms and data structures
  • 5+ years of professional software development experience
  • 5+ years of programming experience with Java/JavaScript
  • Proven experience working with modern SQL/NoSQL databases
  • Proven experience working with large and complex data sets
  • Proven experience building and supporting complex data processing pipelines
  • Proven experience discovering and optimizing data processing solutions
  • Proven experience in applying machine learning on large unstructured data sets
  • Ability to think independently, solving data puzzles and making sense of unstructured data
  • Mentoring junior developers

Preferred Qualifications:

  • Experience in training machine learning models for NLP
  • Experience in training machine learning models for schema discovery and entity categorization
  • Working with GCP
  • Experience working with Kubernetes
  • Experience working with Node.js
  • Experience developing software for the medical industry
  • Working with distributed teams in different timezones
Apply for the Job

Recent Job Postings