READ MORE
All roles

Backend Founding Engineer

Location:
San Mateo, CA / Telecommuting
Department:
Engineering
Apply now

This is a telecommuting position with remote benefit from various unanticipated locations throughout the U.S. Sawmills is the first smart telemetry management platform, designed to handle the explosion of telemetry data generated by modern distributed systems. By integrating deeply with OpenTelemetry and cloud-native infrastructure, Sawmills  AI enables teams to reduce observability cost, improve data quality, and safeguard realibility.

Responsibilities:

  • Lead design/development of Telemetry Data Management Platform
  • Craft scalable backend systems that integrate with open technologies (OpenTelemetry)
  • Develop/scale backend of Telemetry Data Management Platform [emphasizing compatibility with open technologies (OpenTelemetry)]
  • Design relational databases/async communication between microservices (Redis/message queue)
  • Write complex SQL queries, tuning db performance (indexes/partitions)
  • Develop End2End features (K8s, Redis, Postgres, Python, AWS tools)/new AI products (RAG, LLMs)/micro-services (GO, React over AWS EKS, RDS, Kafka, gRPC)
  • Use streaming algorithms (Count-Min-Sketch, HyperLogLog) for high cardinality solutions
  • Architect systems that process/analyze vast amounts of machine data
  • Participate in strategic planning/development of technical roadmaps
  • Work with founders to ensure features meet customers' needs
  • Collaborate with team to enhance integration with widely-used observability tools
  • Ensure robust/flexible data ingestion/analyses capabilities
  • Serve as advocate for/implement best practices in open-source software development
  • Contribute to open-source community
  • Monitor, troubleshoot/optimize system performance/reliability
  • Design/implement end to end CI/CD pipelines (GitHub Actions, Argo CD) to automate testing, security scans, zero downtime releases
  • Develop Terraform infrastructure as code modules to provision/manage multi account AWS environments
  • Work with instrument services (OpenTelemetry SDKs)
  • Define logs/metrics/traces standards to ensure comprehensive observability
  • Conduct performance profiling/load testing (k6, Locust) to validate system scalability to multi petabyte data volumes
  • Implement blue green and canary deployment strategies (AWS EKS) to minimize rollout risks
  • Perform prototype tiered storage/compression strategies (S3 + Glacier, Parquet, ZSTD) to optimize long term data retention costs
  • Lead threat modeling, IAM least privilege design, secret management efforts to harden platform security
  • Build internal developer tooling/SDKs (Go /TypeScript) that speed micro service bootstrapping/ enforce coding standards
  • Manage semantic versioning/release lifecycle of core platform components, ensuring backward compatibility
  • Coordinate incident response, root cause analysis, post mortem documentation to drive reliability improvements
  • Define data governance policies (schema evolution, GDPR/CCPA compliance), audit logging requirements
  • Evaluate/integrate third party data platforms (Snowflake, DuckDB, Vector) through technical proofs of concepts
  • Author technical blogs, conference talks, webinars to promote company thought leadership in observability
  • Oversee vendor assessments/contract negotiations for critical infrastructure services
  • Mentor/coach junior engineers via code reviews, pair programming, individualized growth plans
  • Stay abreast of new technologies/techniques in backend development/observability platforms

Job Requirements:

  • Bachelor's degree or foreign equivalent in Computer Science, Computer Engineering, or Software Engineering, plus 5 years of progressive experience.
  • Experience to include:
    • Designing relational databases/async communication between microservices (Redis/message queue)
    • Writing complex SQL queries, tuning db performance (indexes/partitions)
    • Developing End2End features (K8s, Redis, Postgres, Python, AWS tools)/new AI products (RAG, LLMs)/micro-services (GO, React over AWS EKS, RDS, Kafka, gRPC)
    • Using streaming algorithms (Count-Min-Sketch, HyperLogLog) for high cardinality solution
  • This is a telecommuting position with remote benefit from various unanticipated locations throughout the U.S.
Apply now
Our Benefits