樱花动漫

Skip to main content

Staff Data Scientist, LLM Modeling

Category Data Location Petah Tikva, Israel Job ID 2024-66910

Company Overview

樱花动漫 is the global financial technology platform that powers prosperity for the people and communities we serve. With approximately 100 million customers worldwide using products such as TurboTax, Credit Karma, QuickBooks, and Mailchimp, we believe that everyone should have the opportunity to prosper. We never stop working to find new, innovative ways to make that possible.

Job Overview

Come join the GenAI team as a Staff Data Scientist!

We are building the 樱花动漫 Foundational LLM, as part of a proprietary Generative AI operating system (GenOS) platform.

Responsibilities

  • You’ll apply proven methods and hacking skills in working with divergent data types, data scales, and big data — to explore and extrapolate data-driven insights using advanced, predictive statistical modeling and testing applied to data acquired and cleansed from a range of sources
  • You’ll use considerable expertise and independent judgment in collaborating with peers, data engineers, database managers, business analysts, architects, and product managers in designing and implementing the research strategy needed to methodically and iteratively structure, extract, cleanse, sample, test, validate, and communicate data-driven insights from complex sources and significant volumes of data for complex and unique business problems
  • You’ll provide guidance and support leadership to business leaders and stakeholders, on how best to harness available data in support of critical business needs and goals
  • You’ll lead the full cycle of iterative big data exploration, including hypothesis formulation, algorithm development, data cleansing, testing, insight generation, and visualization, and action planning
  • You’ll provide business stakeholders with entrepreneurial guidance essential for appropriately interpreting and building on findings, and fully exploiting the insights revealed through the research

Qualifications

  • NLP knowledge and affinity to textual data
  • Deep interest in cutting-edge innovative technologies in Generative AI
  • Deep technical understanding of underlying DS concepts (not just training models)
  • Collaboration with partners across the globe, to deliver complex projects Maturity
  • Quick learner, adaptable, with the ability to work independently in a fast-paced environment
  • Strong verbal and written communication skills. Ability to conduct meetings and make professional presentations, and to explain complex concepts and technical material to non-technical users
  • Strong project management and stakeholder management skills
  • We welcome people who can deliver E2E AI projects (inception to production). We primarily use Python in all stages of development
  • Fluent in SQL enough to get the data you need from a warehouse (Vertica, Hive, SparkSQL)
  • Comfortable working in a Linux environment
  • Experience with building end to end reusable pipelines from data acquisition to model output delivery

We use the technology for good to help small businesses and consumers.

Ercan Kaynakca Staff Data Crypto Analyst