Menlo Park, CA, 94025, USA
167 days ago
Linux System Administrator
Linux System Administrator Job ID 5894 Location SLAC - Menlo Park, CA Full-Time Regular **SLAC Job Postings** **Position overview:** Are you passionate about configuring and troubleshooting modern high-performance Linux clusters? Are you eager to contribute to groundbreaking scientific discoveries? SLAC National Accelerator Laboratory is searching for a dynamic Linux System Administrator to join our team. As a Linux System Administrator at SLAC, you'll be instrumental in deploying, maintaining, and monitoring the expansive scientific computing infrastructure that powers SLAC's data analysis and Machine Learning capabilities. Our laboratory is renowned for its pioneering research in photon science, accelerator physics, high-energy physics (HEP), and energy sciences. Joining the Scientific Computing System (SCS) Division within the Technology Innovation Directorate (TID), you'll have the opportunity to work with cutting-edge technologies and collaborate with a diverse user community. These include the Rubin Observatory, the Linac Coherent Light Source (LCLS) user facility, CryoEM user facilities, the LHC ATLAS detector at CERN, and accelerator controls for LCLS-II. If you thrive in a dynamic environment, enjoy teamwork, are excited about learning, and ready to make an impact in the world of scientific computing, we encourage you to apply for this opportunity. Take your career to new heights at SLAC, where innovation and discovery are at the heart of everything we do. **Given the nature of this position, SLAC will require onsite work.** **Your specific responsibilities will be to:** + Support the physical deployment of data center-related technology. + Perform system administration tasks across hundreds of Linux hosts. + Develop, install, and deploy Ansible (configuration management) to maintain operating systems, utilities, and applications software on computing systems. + Maintain a secure environment while allowing for open science to be effectively conducted by staying informed of and implementing security best practices. + Help manage our 24x7 operations by responding to system issues and emergencies in a timely manner. + Deploy and support core logging, monitoring, and alerting (notification) capabilities to track health, performance, and maintain system standards. + Assist in network infrastructure configuration, troubleshooting, and support. + Support Experimental Systems, ensuring seamless day-to-day operations and optimal functionality. This includes troubleshooting, maintenance, and enhancements. + Provide end-user support via our incident platforms and communication channels. + Maintain comprehensive documentation for all administrative procedures, ensuring accuracy and accessibility. + Serve as a primary liaison for all experimental groups, providing technical expertise, guidance, and leadership where needed. **To be successful in this position you will bring:** + Bachelor's degree in computer sciences, physics or related field and 8 years of relevant experience in information technology, systems administration, or high-performance computing; or a combination of education and experience. + Ability to work effectively in a team environment with excellent organizational and communication skills. + Experience with Linux system management, monitoring, and open-source software + Proficient in Bash and Python scripting. + Ability to deploy and troubleshoot multiple hosts in a clustered environment. + Familiarity with distributed compute and storage systems, high-performance computing systems, and networking. + Strong analytical and troubleshooting skills to identify root causes of complex issues. + Ability and willingness to learn and promote best practices. **In addition, preferred qualifications include:** + System administration experience. + Experience with Foreman with Ansible integration. + Experience connecting services to LDAP databases. + Experience configuring network switches. + Using and developing monitoring UIs and dashboards with Grafana. + Understanding security principles and best practices. **SLAC Employee Competencies:** + **Effective Decisions** : Uses job knowledge and solid judgment to make quality decisions in a timely manner. + **Self-Development** : Pursues a variety of venues and opportunities to continue learning and developing. + **Dependability** : Can be counted on to deliver results with a sense of personal responsibility for expected outcomes. + **Initiative** : Pursues work and interactions proactively with optimism, positive energy, and motivation to move things forward. + **Adaptability** : Flexes as needed when change occurs, maintains an open outlook while adjusting and accommodating changes. + **Communication** : Ensures effective information flow to various audiences and creates and delivers clear, appropriate written, spoken, presented messages. + **Relationships** : Builds relationships to foster trust, collaboration, and a positive climate to achieve common goals. **Physical Requirements and Working Conditions:** + You are expected to reside locally and work onsite 5 days a week + Be able to lift heavy objects (> 40 lbs) + Consistent with its obligations under the law, the University will provide reasonable accommodation to any employee with a disability who requires accommodation to perform the essential functions of the job. May work extended hours during peak business cycles. **Work Standards** : + Interpersonal Skills: Demonstrates the ability to work well with Stanford colleagues and clients and with external organizations. + Promote Culture of Safety: Demonstrates commitment to personal responsibility and value for environment, safety and security; communicates related concerns; uses and promotes safe behaviors based on training and lessons learned. Meets the applicable roles and responsibilities as described in the ESH Manual, Chapter 1—General Policy and Responsibilities: http://www-group.slac.stanford.edu/esh/eshmanual/pdfs/ESHch01.pdf + Subject to and expected to comply with all applicable University policies and procedures, including but not limited to the personnel policies and other policies found in the University's Administrative Guide, http://adminguide.stanford.edu ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Classification Title: System Administrator 3 Grade: K Job code: 4833 Duration: Regular Continuing _The expected pay range for this position is $129,000 to $157,000 per annum. SLAC National Accelerator Laboratory/Stanford University provides pay ranges representing its good faith estimate of what the university reasonably expects to pay for a position. The pay offered to a selected candidate will be determined based on factors such as (but not limited to) the scope and responsibilities of the position, the qualifications of the selected candidate, departmental budget availability, internal equity, geographic location and external market pay for comparable jobs._ SLAC National Accelerator Laboratory is an Affirmative Action / Equal Opportunity Employer and supports diversity in the workplace. All employment decisions are made without regard to race, color, religion, sex, national origin, age, disability, veteran status, marital or family status, sexual orientation, gender identity, or genetic information. All staff at SLAC National Accelerator Laboratory must be able to demonstrate the legal right to work in the United States. SLAC is an E-Verify employer.
Confirm your E-mail: Send Email