Modern CPUs Need Smarter Scheduling
Current and future CPUs offer more than just cores and sockets. HPC schedulers require information about L3, L2 caches, cache groups (like AMD's CCX), and power/efficiency cores to allow scenarios...
Current and future CPUs offer more than just cores and sockets. HPC schedulers require information about L3, L2 caches, cache groups (like AMD's CCX), and power/efficiency cores to allow scenarios...
The Model Context Protocol (MCP) defines a powerful and simple protocol for AI applications to interact with external tools. Its key benefit is modularity: any tool implementing an MCP server can be...
The most valuable digital assets in human history are undoubtedly cat pictures :). While they once dominated the web, today they have become a focal point for AI-generated imagery. Therefore, why not...
In today's world of ever-evolving technology, the need for efficient and intelligent job workflows is more important than ever. With the advent of large language models (LLMs) like GPT, we can now...
wfl is a versatile and user-friendly Go library designed to simplify the management and execution of workflows. In this blog post, we will explore how wfl can be employed for various machine learning...
This article was originally published at UberCloud's Blog UberCloud provides all necessary automation for integrating cloud based self-service HPC application portals in enterprise environments. Due...
The Singularity containerization software is re-written in Go. That’s a great message and caught immediately my interest :-). For a short intro into Singularity there are many sources including...
There are many workload managers running in the data centers of the world. Some of them take care of running weather forecasts, end-of-day reports, reservoir simulations, or simulating...
Creating an API which follows the DRMAA2 standard is not an easy task. Unlike the DRMAA(1) standard it defines more then twice as many methods (around 100) which all needs to follow a compatible behavior.
Univa Grid Engine 8.3 is out since a while and even the first patch 8.3.1 is available for download. Lot's of great new features like the long awaited real preemption of jobs. Also a new Web Service...
Todays talk at HEPiX at Oxford University about the DRMAA2 standard. You can follow their live stream here.
The cluster monitoring and job submission API based on the DRMAA2 standard makes it easy to build cluster monitoring and job workflow submission applications. Univa Grid Engine supports this open...
See also the blog entry at www.wheregridenginelives.com.
A free webinar about the upcoming DRMAA2 C API implementation in Univa Grid Engine can be downloaded here.
"Evolution is a process of creating patterns of increasing order" (Ray Kurzweil, The Singularity Is Near) It is obvious that open standards are important in the software industry. They protect...
Looks like the Go (#golang) programming language becomes popular for cluster management. Kamil offers a project called Gorge on github. Similar to my gestatus sub-package of the Go DRMAA...
Today I pushed the source code of the Go DRMAA language binding on a github repository: https://github.com/dgruber/drmaa It now contains a sub-package called gestatus (Grid Engine status) which...
The first language binding is now officially approved as OGF standard GFD-R-P.198. You can get the standard here.
One thing not covered yet is the failure handling in Go DRMAA. The package documentation shows that an error is a pointer to an DRMAA Error type, which is a struct consisting of an error id (Id) and...
In order to continue the Go DRMAA API description series I present below a small program, which submits jobs with Univa Grid Engine specific submission parameters and reports all collected job usage values.
In my last article I showed two basic examples how to use the Go DRMAA binding for simple job submission and job status checks. This time I want to demonstrate how easy it is to submit thousands of...
I just uploaded the slightly enhanced version of the Google Go DRMAA language binding. It offers now the missing JobInfo access methods and the JobTemplate SetArg(), which accepts a simple string....
Google's Go looks for me is like the most interesting programming language published in recent years (of course there are others, like “Julia”, but they are more domain specific (technical...
The public comment period was over and therefore we had yesterday night the final meeting where we went over the last issues of the upcoming DRMAAv2 C language binding. So the official final C spec...
Preliminary DRMAA2 API documentation in pre-alpha version are linked here.
Over in Japan researchers working on the K supercomputer recently published a paper about a middleware software which they call VGE - Virtual Grid Engine. It allows them to run bio informatics...
This article describes how the Univa Grid Engine accounting file can be processed with Glow. The Univa Grid Engine cluster scheduler spits out a text file containing information about the resource...
David Chin created a Python wrapper for Univa Grid Engine's DRMAA2 compatible C library for cluster monitoring, job and workflow management. It is available on his github account. Great!
The Distributed Resource Management Application API (DRMAA) is a programming library for job submission and job management. It abstracts about vendor specific job submission methods and provides a...
DRMAA or DRMAA version 1 is a highly adopted standard for accessing DRMs (distributed resource management systems). The DRMAA IDL API 1.0 specification can be found here and here. Following systems...
The DRMAA 2 standard (final draft 8) is until August 2nd, 2011, 23:59 CET open for public review. The PDF document can be downloaded here. Please download the final DRMAA2 publication here. specification ")
Doing evil things! The JGDI API is an unsupported (but available) Java interface for accessing and controlling Grid Engine. It is unsupported because there are no guarantees that the interface will...