Position: High Availability Platform Development Co-op (Student Position)
Number of Position(s): 2
Duration: 6 Months+
Date: March 3rd, 2025
Location: Remote
EDUCATIONAL RECOMMENDATIONS
Currently a candidate for a Master’s, Bachelor's degree, or Diploma in Computer Science, Software Engineering, or a related technical field with an accredited school in the United States.
Basic knowledge of programming languages such as C and C++ Basic understanding of distributed systems, network infrastructure, and cloud technologies. Knowledge of Linux operating systems and scripting languages (e.g., Bash, Python). Familiarity with high-availability architectures. Strong problem-solving skills and attention to detail. Ability to work collaboratively in a fast-paced environment. Good written and verbal communication skills.Preferred Skills:
Familiarity with version control tools (e.g., Git, GitHub). Basic understanding of CI/CD pipelines and how automated testing integrates into the software development lifecycle.As a part of our team, you will:
Work with a talented team of engineers to support the development of robust, scalable, and highly available platforms. You will assist in building systems that ensure minimal downtime and maintain platform availability even in the event of component failures or performance issues. This role provides hands-on experience in developing fault-tolerant systems, applying modern technologies, and learning best practices in high availability (HA) and disaster recovery (DR) planning.
Assist in the design and development of high availability and fault-tolerant software systems. Collaborate with cross-functional teams to identify platform requirements for HA and disaster recovery. Participate in the development and testing of monitoring, failover, and backup mechanisms to ensure system resilience. Support the implementation of inter-process and inter-system communications Document processes, testing protocols, and support procedures for high-availability solutions. Assist in troubleshooting, debugging, and optimizing software components for increased platform resilience. Research and implement modern tools and techniques for improving system uptime and reliability. Participate in code reviews, testing, and quality assurance efforts to ensure platform stability.