Hpc Cluster Engineer

Hpc Cluster Engineer
Empresa:

Caixa Mágica Software



Função de trabalho:

Engenharia

Detalhes da Vaga

What will you do:Main tasks:Administration of Linux based GPU HPC cluster for Artificial Intelligence (AI)Support VRED Rendering Cluster and HPC cluster for Computer Aided Engineering (CAE)Maintenance of in-house shell scriptsFailed computation investigation, problem determination, incident resolution, system support, co-ordination with vendorSupport and educate users with no Linux experienceInstallation and configuration of hardware, OS and software + tuning for all R&D Linux work-stationsManage patching of Linux systems, including offline systemsManage network aspects (DNS, DHCP, internet access, …) with Network TeamPerform daily monitoring, management of the backup environment (Ceph) and ensure cluster high availabilitySupport artificial intelligence engineers to setup development environment on GPU HPCCreate long term environment management centralizationSupport setup of a driving simulator based on real time OSSupport setup of integrated engineering development environment: Linux laptops, office work-stations, in-car computerSetup patching environment, including workstations without internet connectionEnsure Linux environment match company security standardsCollaborate with other technical teams and integrate Linux workstations in AD domainDeploying/Maintaining AWS AI Cluster.Supporting AWS VRED and CAE clustersAs back-up of other members:Administration of HPC cluster for Computer Aided Engineering (CAE)L1/L2 support on the HPC cluster for the customerMaintain application running on the clusterManage patching and upgrade of the managed environmentMonitor regular backup and ensure cluster high availabilityWhat are we looking for?Linux OS and Server knowledgeCluster managementInfrastructure administrationVirtualization knowledgeStorage solution understanding and operatingAI understanding is an assetKey words which are important in HPC systems:Workload manager: Slurm, PBSParallel File system: Lustre, ceph, beegfsHPC management tools: Bright or Nvidia, XcatAI words: gpu, docker, pythonOS: Rhel, Ubuntu, Rocky LinuxWhat can you expect from us?A permanent job contract for a long term project;Tech equipment + SIM Card + personal smartphone;Health and Life Insurance;Social events and team buildings;The commitment of letting you grow with us, and be rewarded accordingly;A dynamic and young team that will be always there to support you;Training in the latest technologies;Coffee, fruits, snacks and a warm welcoming when you pass by the office.Tipo de oferta: Período IntegralBenefícios:Cartão/Ticket refeiçãoSeguro de vidaSeguro saúdeTelemóvel da empresaHorário de trabalho:Horário flexívelHabilitações literárias:Ensino superior (Preferencial)
#J-18808-Ljbffr


Fonte: Whatjobs_Ppc

Função de trabalho:

Requisitos

Hpc Cluster Engineer
Empresa:

Caixa Mágica Software



Função de trabalho:

Engenharia

Assistente Director De Manutenção

Reportará ao Diretor de Manutenção e ficará responsável por liderar na propriedade as manutenções preventivas e correctivas dos equipamentos, assegurando o c...


Desde United Investments Portugal - Lisboa

Publicado a month ago

Linguistic Engineer

The company's language operations platform blends advanced artificial intelligence with human editors, for fast, efficient, high-quality translations that ge...


Desde Nlp People - Lisboa

Publicado a month ago

Javacard Engineer

1GLOBAL - Seamless Connectivity, eSIM and IoT Solutions. Connect your people and devices instantly, anywhere with our connectivity solutions. Manage mobile d...


Desde Truphone - Lisboa

Publicado a month ago

Lead Ai Engineer

UpHill is the place where health professionals can find best practices to decide and train. We're backed and trusted by top-tier investors and leading client...


Desde Maze Impact Sa. - Lisboa

Publicado a month ago

Built at: 2024-09-29T05:48:23.026Z