Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Site Reliability Engineer, Global E-commerce - USDS image - Rise Careers
Job details

Site Reliability Engineer, Global E-commerce - USDS - job 1 of 2

ResponsibilitiesTikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. U.S. Data Security (“USDS”) is a subsidiary of TikTok in the U.S. This new, security-first division was created to bring heightened focus and governance to our data protection policies and content assurance protocols to keep U.S. users safe. Our focus is on providing oversight and protection of the TikTok platform and U.S. user data, so millions of Americans can continue turning to TikTok to learn something new, earn a living, express themselves creatively, or be entertained. The teams within USDS that deliver on this commitment daily span across Trust & Safety, Security & Privacy, Engineering, User & Product Ops, Corporate Functions and more.Why Join UsCreation is the core of TikTok's purpose. Our platform is built to help imaginations thrive. This is doubly true of the teams that make TikTok possible.Together, we inspire creativity and bring joy - a mission we all believe in and aim towards achieving every day. To us, every challenge, no matter how difficult, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always. At TikTok, we create together and grow together. That's how we drive impact - for ourselves, our company, and the communities we serve. Join us.About the TeamThe Global E-commerce SRE team of US Tech Services works with engineering and product teams to build and run large-scale, globally distributed, observable, fault-tolerant systems. As an SRE, you will deliver on production ownership and be responsible for observability and automation across complex, large-scale service mesh architectures.In order to enhance collaboration and cross-functional partnerships, among other things, at this time, our organization follows a hybrid work schedule that requires employees to work in the office 3 days a week, or as directed by their manager/department. We regularly review our hybrid work model, and the specific requirements may change at any time.What You'll Do:• Own the service level of a critical, revenue generating E-commerce platform as well as all supporting infrastructure and services. This role will focus on service reliability, highly-scalable design and release management in a cloud-native environment.• Define service level indicators and data-driven objectives to uphold and improve uptime, latency, and system health of a core TikTok production platform.• Collaborate cross team with engineering and product to ensure that key requirements (such as capacity planning and launch reviews) are performed to enable transparent service delivery to customers.• Automation geared towards infrastructure-as-code, scalability and service resiliency• Implement SRE practices around incident management, post-mortems while being part of on-call rotations.QualificationsBasic Qualifications:• Good understanding of Unix/Linux operating systems internals and networking• Experience writing code in Java, Go, Python or a similar language• Expertise in designing, analyzing, and troubleshooting large-scale distributed systems (Redis, Elasticsearch, Kafka, Druid, Hadoop, Flink or comparable solutions), relational databases, caching solutions and web service frameworks• Experience with algorithms, data structures, complexity analysis and software design• Experience developing tools and APIs to reduce manual interaction with systems and applications using a variety of coding and scripting standards• Systematic problem-solving approach, coupled with effective communication skills and a sense of drivePreferred Qualifications:• Familiarity with running production grade web services at scale and understanding cloud native technologies and networking• Knowledge about a variety of strategies for ingesting, modeling, processing, and persisting data, ETL design, dimensional modeling, and cube designCandidates for this position must be legally authorized to work in the United States. This position is not eligible for visa sponsorship or support.TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.TikTok is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at https://shorturl.at/ktJP6This role requires the ability to work with and support systems designed to protect sensitive data and information. As such, this role will be subject to strict national security-related screening.Job InformationThe base salary range for this position in the selected city is $137750 - $237500 annually.​Compensation may vary outside of this range depending on a number of factors, including a candidate’s qualifications, skills, competencies and experience, and location. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work, and this role may be eligible for additional discretionary bonuses/incentives, and restricted stock units.​Our company benefits are designed to convey company culture and values, to create an efficient and inspiring work environment, and to support our employees to give their best in both work and life. We offer the following benefits to eligible employees:​We cover 100% premium coverage for employee medical insurance, approximately 75% premium coverage for dependents and offer a Health Savings Account(HSA) with a company match. As well as Dental, Vision, Short/Long term Disability, Basic Life, Voluntary Life and AD&D insurance plans. In addition to Flexible Spending Account(FSA) Options like Health Care, Limited Purpose and Dependent Care.​Our time off and leave plans are: 10 paid holidays per year plus 17 days of Paid Personal Time Off (PPTO) (prorated upon hire and increased by tenure) and 10 paid sick days per year as well as 12 weeks of paid Parental leave and 8 weeks of paid Supplemental Disability.​We also provide generous benefits like mental and emotional health benefits through our EAP and Lyra. A 401K company match, gym and cellphone service reimbursements. The Company reserves the right to modify or change these benefits programs at any time, with or without notice.​For Los Angeles County (unincorporated) Candidates:​Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state, and local laws including the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act. Our company believes that criminal history may have a direct, adverse and negative relationship on the following job duties, potentially resulting in the withdrawal of the conditional offer of employment:​1. Interacting and occasionally having unsupervised contact with internal/external clients and/or colleagues;​2. Appropriately handling and managing confidential information including proprietary and trade secret information and access to information technology systems; and​3. Exercising sound judgment.
TikTok Glassdoor Company Review
3.4 Glassdoor star iconGlassdoor star iconGlassdoor star icon Glassdoor star icon Glassdoor star icon
TikTok DE&I Review
No rating Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon
CEO of TikTok
TikTok CEO photo
Shou Zi Chew
Approve of CEO

Average salary estimate

Estimate provided by employer
$167147 / ANNUAL (est.)
min
max
$146K
$188K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Site Reliability Engineer, Global E-commerce - USDS, TikTok

Join TikTok as a Site Reliability Engineer within the Global E-commerce team in Seattle, WA, where your role will be pivotal to the success of our widely-used platform. At TikTok, we thrive on creativity, and as an SRE, you will have the opportunity to own the reliability of a critical e-commerce platform that millions of users depend on. Your day-to-day will involve collaborating with engineering and product teams to ensure our distributed systems are not only functional but also efficient, highly scalable, and resilient. You’ll define service level indicators that enhance uptime and system health while emphasizing automation and infrastructure-as-code practices. With a hybrid work model offering flexibility, you’ll be encouraged to continue innovating and overcoming challenges together as part of a close-knit team. If you have a solid understanding of Unix/Linux systems, coding expertise in Java, Go, or Python, and a knack for problem-solving across large-scale distributed systems, this is the perfect opportunity for you to influence the tech industry. Come be a part of our mission to inspire creativity and bring joy, all while growing your skills in a supportive environment centered around teamwork and innovation. Let's create and grow together at TikTok!

Frequently Asked Questions (FAQs) for Site Reliability Engineer, Global E-commerce - USDS Role at TikTok
What are the responsibilities of a Site Reliability Engineer at TikTok?

As a Site Reliability Engineer at TikTok, your primary responsibilities will involve owning the reliability of our e-commerce platform, enhancing system observability, and automating processes essential to maintaining high uptime and performance. You'll collaborate with other teams to define key performance indicators and ensure seamless service delivery, while also participating in on-call rotations to address incidents and contribute to continuous improvement efforts.

Join Rise to see the full answer
What qualifications are required for the Site Reliability Engineer position at TikTok?

To qualify for the Site Reliability Engineer role at TikTok, candidates should possess a good understanding of Unix/Linux operating systems and networking, along with proficiency in coding languages such as Java, Go, or Python. Additionally, experience in troubleshooting and analyzing large-scale distributed systems, as well as a systematic approach to problem-solving, is essential. Familiarity with cloud-native technologies and production-grade web services is also preferred.

Join Rise to see the full answer
What is the work environment like for a Site Reliability Engineer at TikTok?

The work environment for a Site Reliability Engineer at TikTok is dynamic and collaborative. Following a hybrid work model, team members are expected to work in the office three days a week, enhancing face-to-face collaboration. TikTok fosters a culture of creativity, innovation, and mutual support, allowing engineers to thrive and contribute to complex projects while also growing professionally.

Join Rise to see the full answer
What type of projects will a Site Reliability Engineer work on at TikTok?

Site Reliability Engineers at TikTok will engage in a variety of high-impact projects related to the e-commerce platform's architecture, focusing on service reliability, systems automation, and scalability. This includes defining service level indicators, automating infrastructure management, and troubleshooting large-scale systems to ensure the platform remains robust and user-focused.

Join Rise to see the full answer
How does TikTok support professional development for Site Reliability Engineers?

TikTok is deeply committed to fostering professional development for Site Reliability Engineers through continuous training, mentorship opportunities, and cross-team collaboration. The culture encourages engineers to innovate and develop skills in cutting-edge technologies while working on real-world challenges that impact millions of users, all in an inclusive environment.

Join Rise to see the full answer
Common Interview Questions for Site Reliability Engineer, Global E-commerce - USDS
How do you define service level indicators for a platform?

When defining service level indicators (SLIs), focus on metrics that directly impact user experience, such as uptime, response time, and error rates. Ensure that SLIs are measurable, relevant to business goals, and support the service level objectives (SLOs) established for the platform.

Join Rise to see the full answer
Can you explain your experience with distributed systems?

In discussing your experience with distributed systems, highlight specific projects you've worked on, the technologies you used, and the challenges you overcame. Emphasize your understanding of components like load balancers, microservices, and data consistency in such environments.

Join Rise to see the full answer
What steps do you take to troubleshoot a production issue?

To troubleshoot a production issue, I typically follow a systematic process: first, I identify the symptoms and gather relevant data (logs, metrics). Then, I analyze the information to find potential root causes, collaborating with peers if necessary, before implementing a fix and documenting the incident for future reference.

Join Rise to see the full answer
How do you handle on-call rotations and incident management?

While on-call, I prioritize proactive monitoring and use automation to reduce manual interventions. In managing incidents, I follow established protocols to communicate effectively with stakeholders, gather data quickly, and drive resolution efforts. Post-incident, I advocate for conducting thorough post-mortems to learn from mistakes.

Join Rise to see the full answer
What programming languages have you used, and how have you applied them as an SRE?

I've utilized programming languages such as Java and Python to develop automation scripts, create APIs, and improve our infrastructure management. By applying my coding skills, I’ve been able to significantly reduce manual tasks and enhance the efficiency of our deployment processes.

Join Rise to see the full answer
Can you discuss a time when you improved system reliability?

Certainly! In a past role, I identified a bottleneck affecting system performance. After analyzing the architecture, I implemented a caching solution that improved response times by 50%. This change not only enhanced user experience but also reduced load on backend systems, resulting in higher reliability.

Join Rise to see the full answer
What monitoring tools are you familiar with, and how have you used them?

I'm familiar with monitoring tools like Prometheus, Grafana, and Datadog. I have used these tools to set up alerts for critical metrics, visualize data for better insights, and help the team understand performance trends and anomalies, ensuring we maintain optimal system health.

Join Rise to see the full answer
Describe your experience implementing site reliability engineering practices.

In my previous roles, I implemented SRE practices by establishing clear SLOs, automating processes, and fostering a culture of collaborative incident response. I helped to create runbooks for repetitive issues and encouraged regular reviews of our practices to align with best-in-class standards.

Join Rise to see the full answer
What challenges have you faced while scaling a system, and how did you overcome them?

One challenge I encountered was during a major product launch when our system could not handle the unexpected traffic. To overcome this, I worked on enhancing our load-balancing strategy and increased our instances to scale dynamically during high demand, resulting in a successful launch.

Join Rise to see the full answer
How do you keep yourself updated with the latest technologies and trends in SRE?

I stay updated on the latest technologies and trends in SRE by engaging with online communities, attending conferences, and participating in webinars. I also invest time in self-study through books and courses, which helps me keep my skills sharp and my approach innovative.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Inclusive & Diverse
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony
Take Risks
Casual Dress Code
Startup Mindset
Emails over Meetings
Collaboration over Competition
Fast-Paced
Growth & Learning
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Mixe-Ability Accomodations
Work Visa Sponsorship
Commuter Benefits
Employee Resource Groups
Performance Bonus
Health Savings Account (HSA)
Flexible Spending Account (FSA)
Photo of the Rise User
Inclusive & Diverse
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony
Take Risks
Casual Dress Code
Startup Mindset
Emails over Meetings
Collaboration over Competition
Fast-Paced
Growth & Learning
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Mixe-Ability Accomodations
Work Visa Sponsorship
Commuter Benefits
Employee Resource Groups
Performance Bonus
Health Savings Account (HSA)
Flexible Spending Account (FSA)
Photo of the Rise User
Qualis Corporation Hybrid No location specified
Posted 13 days ago
Photo of the Rise User
Pipedrive Remote Czech Republic, Prague
Posted 5 days ago
Photo of the Rise User
Posted 2 days ago
Photo of the Rise User
Redwood Materials Hybrid Carson City, Nevada, United States
Posted 11 days ago
Photo of the Rise User
Domino's Hybrid 3355 Mike Collins Dr, Eagan, MN 55121, USA
Posted 2 days ago

Our mission is to inspire creativity and bring joy.

218 jobs
MATCH
Calculating your matching score...
BADGES
Badge Flexible CultureBadge Future MakerBadge Global CitizenBadge InnovatorBadge Rapid Growth
CULTURE VALUES
Inclusive & Diverse
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony
Take Risks
Casual Dress Code
Startup Mindset
Emails over Meetings
Collaboration over Competition
Fast-Paced
Growth & Learning
BENEFITS & PERKS
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Mixe-Ability Accomodations
Work Visa Sponsorship
Commuter Benefits
Employee Resource Groups
Performance Bonus
Health Savings Account (HSA)
Flexible Spending Account (FSA)
FUNDING
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, hybrid
DATE POSTED
December 4, 2024

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!