Software Engineer - Site Reliability Engineer (SRE)

Lovelace Ai
Pittsburgh, PA

About Us:

Lovelace is the only provider of enterprise-scale context engines capable of analyzing trillions of real-time data points to create knowledge graphs that are usable by autonomous agents at the speed, scale, and accuracy required for mission-critical analysis.

Lovelace’s context engine platform, Elemental, uniquely integrates data ingestion, entity resolution, and graph building into a single pipeline that empower agentic deployments, delivering 1000X the investigative power for complex queries. With its proprietary ground-breaking YottaGraph, Lovelace provides enterprises with real-time, real-world context, enabling agents to understand the impact of global intelligence on enterprise data for unmatched insights with millisecond precision.

Founded in 2023 by Andrew Moore, former head of Google Cloud AI, dean of Carnegie Mellon’s School of Computer Science, and first AI advisor for U.S. CENTCOM, Lovelace currently works with some of the largest public and private enterprises in the world.

Job Summary:

  • Lovelace AI is seeking a highly skilled and motivated Site Reliability Engineer (SRE) to join our growing team. As an SRE at Lovelace AI, you will play a critical role in ensuring the availability, scalability, and performance of our cutting-edge AI-powered applications and infrastructure. You will bridge the gap between software development and operations, applying sound engineering principles and automation to maintain and improve our systems.

​​​​ Key Responsibilities:

  • Design, implement, and maintain robust monitoring, alerting, and observability solutions to proactively detect and resolve issues before they impact end-users.

  • Lead troubleshooting efforts for complex production issues, providing detailed root cause analysis (RCA) and implementing preventative measures.

  • Develop and maintain automation scripts, build systems (Bazel) and infrastructure as code (IaC) using tools like Terraform, Ansible, or CloudFormation to eliminate manual tasks and improve system reliability and efficiency.

  • Collaborate closely with software engineering teams to influence the design of new services and applications, ensuring they are scalable, reliable, and resilient from the outset.

  • Participate in on-call rotations to respond to platform emergencies, alerts, and escalations, ensuring high service uptime.

  • Analyze system performance and recommend optimizations for scalability, reliability, and efficiency.

  • Implement and enforce best practices in deployment, monitoring, and incident management to continuously improve overall system reliability and reduce downtime.

  • Develop and maintain internal tools that streamline complex operations, track bugs, manage CI/CD pipelines, and facilitate cross-team communication.

  • Conduct post-incident reviews, documenting software problems and solutions in a shared knowledge base to prevent similar issues in the future.

  • Assist with vulnerability management, system patching, and implementing security measures to protect the integrity and availability of services.

Qualifications:

  • 5+ years of experience in site reliability engineering, DevOps, systems administration, or related roles.

  • Proven track record of managing complex infrastructure, troubleshooting production issues, and optimizing system performance in high-scale environments.

  • Strong experience with Linux/Unix administration and proficiency in scripting languages (e.g., Python, Bash, Go).

  • Deep understanding of cloud platforms (AWS, GCP, Azure) and related services (e.g., EC2, S3, Lambda, Kubernetes).

  • Experience with containerization and orchestration technologies like Docker and Kubernetes.

  • Proficiency with monitoring and observability tools (e.g., Prometheus, Grafana, Datadog, Dynatrace, ELK Stack).

  • Strong understanding of networking fundamentals (DNS, TCP/IP), load balancing, and CDNs.

  • Experience with CI/CD tools (e.g., Jenkins, GitLab CI, CircleCI) and infrastructure automation.

  • Familiarity with distributed systems and microservices architecture.

  • Excellent problem-solving and troubleshooting skills.

  • Strong analytical skills with the ability to identify Service Level Indicators (SLIs) and align efforts to meet availability and latency objectives.

  • Ability to balance both development and support roles effectively.

  • Strong interpersonal skills and excellent communication skills, with the ability to collaborate effectively across various teams.

  • Experience in working on projects that involve business segments.

  • Must be a US Citizen.

Benefits:

LovelaceAI offers competitive compensation packages, comprehensive benefits. We provide a supportive and inclusive work environment where your skills and expertise can make a significant impact on the safety and security of our communities.

Lovelace’s founding team includes:

Andrew Moore , who has a track record of building impactful AI systems, designing them with human rights impact assessments as a top priority, leading the AI division of one of the world’s foremost cloud companies, and actively participating in machine learning and AI research over the past two decades.

Toby Smith , well known in the Pittsburgh Tech community for his engineering leadership and design skills, and who has led many of the most ambitious and complex system infrastructure projects in Google Pittsburgh and NetApp.

Here is a note from Andrew Moore to people who are reading these Job Postings:

“Hi folks, I’m so glad you are potentially interested in Lovelace AI. This area means a lot to me because while I am an AI optimist, I also think that we technologists owe it to a rightly skeptical world to show that modern intelligent systems can actually be useful. Usefulness comes in many guises: from life sciences to education and from transportation to entertainment and many others. For many of us, security and public safety is also very high on that list. That reasoning leads to this conclusion: I’m determined to make sure that the people building Lovelace AI gain a lot from the experience, including the chance to solve fascinating problems in computer science, AI, business development, customer success and product management. I also hope that we all learn from each other in a highly enriching work environment. But my main hope is that we have a shared sense of accomplishment as we see an increasing number of national security and public safety domains made safer through sensible and robust use of advanced computer science."

Posted 2026-05-24

Recommended Jobs

Childcare Infant Teacher

Kiddie Academy of Harrisburg, PA
Harrisburg, PA

Job Description Job Description Teachers are the pillars of every community, igniting a passion for learning that lasts a lifetime! Join us on this educational journey and GROW with us! JOB SUM…

View Details
Posted 2026-04-17

Transportation Litigation Associate Attorney

Wilson Elser - Attorneys
Philadelphia, PA

Job Description Job Description Wilson Elser is a leading defense litigation law firm with more than 1400 attorneys in 46 offices throughout the United States. Founded in 1978, we rank among the…

View Details
Posted 2026-04-23

Tax Accountant

OnlyHire.Me
Pittsburgh, PA

Tax Accountant Location: Pittsburgh, PA, 15219 Country: United States Salary: $75000-$85000 Start Date: Description: About the Job Status: Full Time, Salary Reports to: Partn…

View Details
Posted 2026-03-06

Installer

TAFCO
Clearfield, PA

Installer Job Summary: We are looking for a detail-oriented and skilled Installer to join our team. The Installer will be responsible for setting up, installing, and ensuring the smooth functio…

View Details
Posted 2025-10-16

Specialist, Information Assurance Compliance II (SIAC2)

Armada Ltd
Philadelphia, PA

Job Description Job Description Type: Full Time Location: Philadelphia, PA Overtime Exempt: Exempt Reports To: ARMADA HQ Travel Required: Yes Security Clearance Required: Act…

View Details
Posted 2026-03-20

Marketing Manager / E-Commerce Oversight

SOFITEL
Philadelphia, PA

Job Description Job Description Company Description About Sofitel Philadelphia at Rittenhouse Square Located in the prestigious Rittenhouse Square district , the AAA Four Diamond Sofit…

View Details
Posted 2026-04-17

Home Care HR Recruiter

Eminence Home Care
Bala Cynwyd, PA

Eminence Home Care is looking for a passionate and results-oriented Home Care HR Recruiter to join our dedicated team. In this role, you will be responsible for sourcing, recruiting, and onboarding…

View Details
Posted 2026-05-06

2nd Shift - Industrial Maintenance Technician / Easton PA

The Wenger Group
Easton, PA

Job Description Job Description Who are we: We're a leading Northeast family-owned food, agricultural products, and agricultural services organization headquartered in Pennsylvania. We provide…

View Details
Posted 2026-03-20

Business Development Manager - Metal Stamping

Jk Tool - New Kensington, Pa
New Kensington, PA

Description Weiss-Aug Group is an internationally recognized leader in precision metal stamping, injection molding, value-added assembly solutions and tooling. Our JK Tool Division located in N…

View Details
Posted 2026-05-24

Lead software developer

Collabrium Systems
Harrisburg, PA

Design, develop, implement, test, and deploy software applications. Analyze systems to ensure proper database architectures, coding standards, and that quality assurance policies and procedures are …

View Details
Posted 2026-05-24