HTCondor-CE

The HTCondor-CE software is a Compute Entrypoint (CE) based on HTCondor for sites that are part of a larger computing grid (e.g. European Grid Infrastructure, Open Science Grid). As such, HTCondor-CE serves as a "door" for incoming resource allocation requests (RARs) — it handles authorization and delegation of these requests to a grid site's local batch system. Supported batch systems include Grid Engine, HTCondor, LSF, PBS Pro/Torque, and Slurm.

For an introduction to HTCondor-CE, watch our recorded webinar from the EGI Community Webinar Programme:

What is a Compute Entrypoint?

A Compute Entrypoint (CE) is the door for remote organizations to submit requests to temporarily allocate local compute resources. These resource allocation requests are submitted as pilot jobs that create an environment for end-user jobs to match and ultimately run within the pilot job. CEs are made up of a thin layer of software that you install on a machine that already has the ability to submit and manage jobs in your local batch system.

What is HTCondor-CE?

HTCondor-CE is a special configuration of the HTCondor software designed as a Compute Entrypoint. It is configured to use the HTCondor Job Router daemon to delegate resource allocation requests by transforming and submitting them to the site’s batch system.

Benefits of running the HTCondor-CE:

  • Scalability: HTCondor-CE is capable of supporting ~16k concurrent RARs
  • Debugging tools: HTCondor-CE offers many tools to help troubleshoot issues with RARs
  • Routing as configuration: HTCondor-CE’s mechanism to transform and submit RARs is customized via configuration variables, which means that customizations will persist across upgrades and will not involve modification of software internals to route jobs