Springfield, Virginia, USA
4 days ago
Enterprise Management Engineer (Performance, Availability, and Capacity)
REQ#: RQ188302Public Trust: None Requisition Type: Regular Your Impact

Own your opportunity to serve as a critical component of our nation’s safety and security. Make an impact by using your expertise to protect our country from threats.

Job Description

We are GDIT.  The people supporting and securing some of the most complex government defense, and intelligence projects across the country.  We ensure today is safe and tomorrow is smarter. Our work has meaning and impact on the world around us, but also on us, and that's important.

GDIT is your place. You make it your own by embracing autonomy, seizing opportunity, and being trusted to deliver your best every day. We think. We act. We deliver. There is no challenge we can't turn into opportunity. And our work depends on TS/SCI level cleared Enterprise Management Engineer joining our team to support our Intelligence customer in Springfield, VA.

Description

Enterprise Management Engineer is needed to support Data Center Service in support of installation, administration, management, configuration, testing, and integration tasks related to Fault and Performance tools.  Work independently as part of a small team of system engineers responsible for the care and feeding of a diverse IT infrastructure

Principle Responsibilities

Demonstrated hands-on experience and knowledge of Linux platform (i.e. RHEL, CentOS) including administration, management, and troubleshooting for physical and virtual platformsDemonstrated hands-on experience and understanding of proactive monitoring concepts, including experience configuring and deploying Network and systems monitoring (i.e. SNMP, Nagios, Splunk, SolarWinds, etc.)Demonstrated experience performing trend analysis on overall system health, performance, and capacity management with regard to utilization and growthDemonstrated ability to develop and maintain capacity metricPerforms software upgrades, patch installs, firmware upgrades then test for functionality on a periodic basis.Demonstrated knowledge of the following infrastructure principles: Fault tolerance, High availability, Scalability and Capacity planning, Data center organization, Backup / RecoveryCreate shell and Perl scripts in various shells to automate daily and periodic tasksMaintain server configuration baselines and configuration compliance against baseline/benchmarksCollaborate with Application Teams to perform system maintenance and patch management tasksDocument work for leadership, update/create Standard Operating Procedures, and brief staff and customers various tasks.Interfaces with other engineering teams to adapt performance management tool capabilities to meet operational requirements.Assists with analysis using enterprise tool solutions and other tools to detect and respond to IT events, incidents, and outages.Performing systems hardening to DoD StandardsApply vendor patches and new designs to keep products up-to-date and meet security requirements.Work with other Service Providers to support areas of common interestWork with others engineers to establish and modify thresholds to better monitor systemsWorking with software and hardware vendors to resolve issues and share requirementsAssume other duties/projects as they arise and be responsive to the needs of the department

Must have current 8570 IAT II Level Certification (CNA-Security, GICSP, GSEC, Sec+ CE, SSCP) or higher within 90 days of hire.

Education/

Equivalent

Training Required

Bachelor’s Degree in Computer Science, Engineering or a related technical discipline, or the equivalent combination of education, technical training, or work/military experience

Professional certification in one or more relevant technologies.

Must have current 8570 IAT II Level Certification (CNA-Security, GICSP, GSEC, Sec+ CE, SSCP) or higher within 90 days of hir.

Experience

8+ years of related systems engineering experience.

5+ years of experience with monitoring tools

Skills and Abilities:

Abilities:

Must be able to support a large, complex server and network infrastructureMust possess a strong work ethic, be self-directed, and be a detail-oriented professionalMust be willing to learn and adapt to new, cutting edge technologiesMust be willing to document work and participate in customer change proceduresMust possess excellent time management skills and the drive to work unsupervisedMust be a team player; willing to both share knowledge and learn from others to ensure the team's success

Required Skills

Experience with Linux systems in the areas of system administration troubleshooting, integration, shell scripting, and development.Advanced scripting skills to include experience using Perl and PythonExperience with Infrastructure as a Service (IaaS)Ability to partner with other systems administrators, storage administrators, application developers, and network engineers to solve complex problemsStrong understanding of enterprise networks including load balancers, routers, switches, TCP/IP, DNS, Local Area Networking, AD, GPOFault event rules development, performance threshold managementExperience with tools in an enterprise environmentEvent analysisExperience with one or more of security hardening, backup management, capacity planning, change management, or patch management.Strong understanding of enterprise networks including load balancers, routers, switches, TCP/IP, DNS, Local Area Networking, AD, GPOAdvanced knowledge of systems engineering principles, methods, and techniques.Knowledge of the current industry hardware, software, and equipment.

Desired Skills

Experience with fault and performance management tools; including installation, configuration, and administrationExperience with CA eHealth and/or other network performance toolsExperience with any of the following: JIRA, Nagios, Python, Puppet, YUMRpo, Ansible Tower, and/or ILOMFamiliarity of VMware ESX 6.0/6.5/6.7  Experience with Windows System Administration

#RoverGSS

Confirm your E-mail: Send Email