About the company
Backed by industry-leading entrepreneurs and investors, Heretic Ventures builds companies at the intersection of culture, commerce, and creators, with a focus on using disruptive technology, like AI, to gain widespread adoption and lasting competitive advantage.
Job Summary
Responsibilities
šAI Model Training Pipeline Management Design, develop, and maintain scalable and efficient data pipelines for training AI models. šCollaborate with AI engineers and company management to understand requirements and implement solutions that result in improved model performance šImplement data versioning, tracking, and monitoring systems to ensure the quality and historical preservation of training data šBusiness Analytics Platform šLead the management and optimization of business analytics platforms, enabling stakeholders to derive actionable insights from diverse datasets including but not limited to behavioral data, transactional data, survey data šCollaborate with cross-functional teams to gather business requirements and translate them into data engineering solutions šEnsure data consistency, accuracy, and reliability within the business analytics platforms. šData Governance and Security šEnforce data governance policies and security measures to protect sensitive information šCollaborate with the security team to implement and maintain data privacy standards and compliance šDocumentation and Knowledge Sharing šCreate and maintain comprehensive documentation for data pipelines, ensuring knowledge transfer within the team
Qualifications
šBachelor's or Master's degree in Computer Science, Data Science, or a related field. š5 yearsā experience as a Data Engineer with a focus on data pipelines, cloud data solutions, analytics databases, visualization tools and business analytics platforms. Direct experience with AI model pipelines is a plus, but not required. šStrong proficiency in SQL, Python, and Linux. Comfortable with or ability to learn light Javascript for light tool configuration. šExperience working with large data sets, both structured and unstructured. Experience converting unstructured JSON and other data types into structured data in a scalable, automated manner. šUnderstanding of machine learning model types and generative AI concepts is a plus. šExperience with cloud based data and image management platforms such as BigQuery, Snowflake, Redshift, and S3. šIn-depth knowledge of database systems, data warehousing, and ETL and data orchestration processes.