Big Data Systems Operations Engineer

Smith Arnold Partners

Global Internet Technology Company focused on Global Digital Advertising Technology. AD TECH Powerhouse! Award Winning Company, experiencing rapid Growth!
Employee Testimonials:
People are smart, friendly, and more than willing to help you learn and grow.
Best company I have ever worked for!
Incredible opportunities for career growth, big projects!
Very open environment where the upper management keeps the employees in the loop with the latest achievement and what is coming next.

Sr. Linux Administrator
Manhattan, NY – Temporality Remote
140,000 – 160,000 + Bonus

-Sr. Linux Administrator responsible for effective provisioning, installation/configuration, operation, and maintenance of systems hardware, software and related infrastructure (8000 Linux Server Global Environment)
-The systems run various distributed applications such as Kubernetes, Nginx, and Artifactory, as well as more traditional applications such as DHCP, DNS, and Kickstart. Responsibilities on these systems include system administrative engineering and provisioning, operations and support, maintenance and research and development to ensure the platform adapts or exceeds business needs.
-Participate in technical research and development to enable continued innovation within the infrastructure. You will ensure that system hardware, operating systems, software systems, and related procedures adhere to organizational values, enabling staff and our partners to be successful
-Manage servers and configure hardware, peripherals, services, settings, directories, storage, etc. in accordance with standards and project/operational requirements
-Identify areas of operation where automation can increase efficiency and decrease human error and implement a solution to do so
-Evaluate new versions of software/technologies and provide and implement any changes and tasks necessary to leverage it for operations or project needs
-Identify potential security risks and propose practical mitigation measures
-Create, verify, and review patches to the software that runs the infrastructure in the form of pull-requests
-Ensure the integrity and availability of all hardware and key services by utilizing monitoring tools, log aggregation tools, and customer reports
-Ensure business data integrity by supporting our storage systems and performing any maintenance tasks necessary to prevent data loss (hardware repairs, fire drills, integrity checks)
-Provide support per requests from various constituencies. Investigate and troubleshoot any issues reported
-Maintain operations runbooks, configuration, or other procedures.
-Perform ongoing performance tuning, hardware upgrades, and resource optimization as required. This requires using various performance tuning tools to identify bottlenecks internal and external to the system.

-5+ years of Linux experience in supporting Debian-based distributions such as Ubuntu
-5+ years writing scalable tools using scripting languages such as Perl, python and shell
-5+ years in configuration management tools such as Puppet, Ansible, and Terraform
-5+ years of managing storage systems running ZFS or CephFS
-3+ years of deploying and administering repository managers, especially with JFrog Artifactory
-3+ years of using monitoring tools such as Nagios and Sensu
-3+ years of deploying and administering systems using container technologies, especially with Kubernetes and Docker, as well as Helm, Spinnaker, Prometheus, Calico, Flannel, Fluentd, and influxdb
-2+ years of building and managing Debian software packages from source, including creation of Makefiles.
-Familiarity with Git and other source control tools are required
-Familiarity with using AWS or Azure is preferred but not required
-Familiarity with configuring NGINX and Kerberos is preferred but not required
-Familiarity with log management tool such as Splunk or SumoLogic is preferred but not required