Location: Fully remote however you MUST either live in CT or ET or be willing to be available as needed during CT/ET office hours 4 days a week
Skills: Approx 5+ yrs SRE & DevOps Engineering
Salary: Up to 150k base + great benefits + work 9-5pm Monday-Thursday only!
We are an Enterprise SaaS company which is truly passionate about its social mission. You will developing software which helps some of society's most vulnerable. If you are a seeking a mission driven company with incredible people and amazing work life balance.....read on!
Top Reasons to Work with Us
- Mission driven company with employees who really care about that mission, have a high IQ but also high EQ! Great team of people who are very collaborative, open and who care deeply about each other and the work they are doing
- Very flexible culture. The whole company works remote
- Strong work/life balance - WORK ONLY 4 DAYS A WEEK!! (9-5 M-Th). Unlimited PTO but all employees are REQUIRED to take a minimum of 3 weeks!
- Work on modern stacks and work on a greenfield engineering initiative
What You Will Be Doing
This is an incredible opportunity to join a growing SaaS company and a group of highly motivated cross-functional Scrum Team members (Dev, QA, Product, Design). As we look to evolve our AWS cloud operations for our product as part of our broader redesign initiative, we would look to our site reliability engineers (SREs) to empower our users with a rich feature set, high availability, and stellar performance level to pursue their missions. Specifically, we are searching for someone who brings fresh ideas, demonstrates a unique and informed viewpoint, and enjoys collaborating with a cross-functional team to develop real-world solutions and positive user experiences at every interaction. You will design and develop our IaC using cloud and containerization technologies. AWS is our cloud platform of choice.
Responsibilities:
- Work closely with the other engineers on the team to build and evolve our AWS cloud infrastructure for our product using infrastructure-as-code (IaC)
- Use Pulumi to write infrastructure-as-code using Typescript
- Use GitLab CI for CI/CD automation and Docker for containerization
- Design, implement and evolve our product's cloud infrastructure for optimal system availability, reliability, performance and scalability based on business needs and cost considerations, working closely with Product and Engineering leads
- Manage, configure and troubleshoot OS, storage, networking (VPCs, proxies and CDNs), and administer high-availability PostgreSQL using AWS RDS
- Build monitoring that alerts to spot symptoms and allow preemptive fixes before incidents occur
- Collaborate with other engineers to provide operational support for our product systems
- Document architecture, processes, and create/maintain runbooks
- Leads by example in terms of adherence to our Engineering Definition of Done in PR reviews & merges
- Manage cloud security via AWS tooling across networking, configuration, and identity
- Write and maintain automation to reduce manual tasks
- Pick up and understand new tools, concepts, requirements efficiently and quickly
- Be an accountable and committed member of a highly functional Scrum Team
- Drive Sprint goals to completion by monitoring Team JIRA boards and working closely (swarm) with team members to get user stories completed in priority order
What You Need for this Position
What's 100% essential:
- 5+ yrs Site Reliability Engineering & DevOps
- Networking (must understand network interfaces & how to connect networks - bonus points if that is also in the cloud)
- Linux Administration & Apache (or similar)
- AWS - good experience with the basics like EC2 and IAM
- Database Administration - at least some experience with permissioning and scalability, preferably with Postgres
- Containerization (Kubernetes or similar)
- Scripting skills
- IaC experience
What's nice to have but NOT required:
- CICD pipelines with Gitlab
- Saltstack or similar for CM
- Healthcare applications
- PHP-FPM/Laminas framework, or JavaScript or TypeScript
What's In It for You
Competitive base, bonus plus strong benefits package including low cost medical/dental, 30 days PTO/holidays, flexible hours and working from home, 4 day work week, FSA, 401k match.
So, if you are a Senior Cloud Site Reliability Engineer with experience, please apply today!
Applicants must be authorized to work in the U.S.