Explore and Test Docker Containers
Linux containers are a way to build a self-contained environment that includes software, libraries, and other tools. This guide shows how to explore and test a Docker container on your own computer.
A. Overview
Note that all the steps below should be run on your own computer, not in CHTC.
This guide has two sections:
If you’ve never used Docker before, and/or are getting ready to build your own container image, we recommend starting with the first part of the guide.
If you’ve explored Docker already or built your own image and you want to test if it will work successfully in CHTC’s HTC system, you can follow the directions in the second section.
A. Set Up Docker on Your Computer
If you haven’t already, create a DockerHub account and install Docker on your computer. You’ll want to look for the Docker Community Edition for your operating system. It sometimes takes some time for Docker to start, especially the first time. Once Docker starts, it won’t open a window; you’ll just see a little whale and container icon in one of your computers toolbars. In order to actually use Docker, you’ll need to open a command line program (like Terminal, or Command Prompt) and run commands there.
B. Explore Docker Containers
1. Get a Docker Container Image
We need to have a local copy of the Docker container image in order to test it. You can see what container images you already have on your computer by running:
$ docker image ls
If you just installed Docker on your computer and are using it for the first time, this list is probably empty. If you want to use a pre-made container from Docker Hub, you will need to “pull” it down to your computer. If you created a container on your computer, it should already be in the list of container images.
If using a container from Docker Hub, find the container and its name, which
will be of the format: username/imagename:tag
. Then pull a copy of the container
image to your computer by running the following from either a Terminal
(Mac/Linux) or Command Prompt (Windows):
$ docker pull username/image:tag
If you run docker image ls
again, you should see the container you downloaded
listed.
2. Explore the Container Interactively
To actually explore a container, run this command:
$ docker run -it --rm=true username/image:tag /bin/bash
This will start a running copy of the container and start a command line shell inside. You should see your command line prompt change to something like:
root@2191c1169757:/#
What Do All the Options Mean?
-it
: interactive flag--rm=true
: after we exit, this will clean up the runnining container so Docker uses less disk space.username/image:tag
: which container to start/bin/bash
: tells Docker that when the container starts, we want a command line (bash
) inside to run commands
If you explore the container using cd
and ls
, you’ll see that this is a whole,
self-contained file system, separate from your computer. Try running commands with their
--help
or --version
options to see what’s installed. If you’re planning to create
your own container, try following a few of the installation instructions for the software
you want to use and see what happens.
3. Exit the Container
Once you’re done exploring, type exit
to leave the container.
root@2191c1169757:/# exit
Note that any changes or
commands you ran in the container won’t be saved! Once you exit the
running container is shut down and removed (although the container image will still be
on your computer, which you can see if you type docker image ls
again).
C. Simulate a CHTC Docker Job
The directions above were about simply exploring a container. If you want to simulate what happens in a CHTC job more specifically, we’ll want to do a few things:
- create a test working directory, with needed files
- have a list of commands to run or a script you want to use as the executable.
- use some extra options when running the container.
1. Create Working Directory
For testing, we need a folder on your computer to stand in for the working directory that HTCondor creates for running your job. Create a folder for this purpose on your Desktop. The folder’s name shouldn’t include any spaces. Inside this folder, put all of the files that are normally inside the working directory for a single job – data, scripts, etc. If you’re using your own executable script, this should be in the folder.
Open a Windows Command Prompt or Mac/Linux Terminal to access that folder, replacing “folder” with the name of the folder you created.
- Mac/Linux:
$ cd ~/Desktop/folder
- Windows:
$ cd %HOMEPATH%\Desktop\folder
2. Plan What to Run
Once the container starts, you have a few options for testing your job:
- Run Commands Directly
- When you start the container, you’ll be able to run each command you want to use, step-by-step. If you have multiple commands, these will eventually need to be put into a shell script as your executable.
- Example: Running multiple steps of a bioinformatics pipeline
- Run an Executable
- If you’ve already written a script with all your commands or code, you can test this in the container.
- Examples: Running a shell script with multiple steps, running a machine learning Python script
- Run a Single Command
- If you only want to run one command, using a program installed in the Docker container, you can run this in the container.
- Example: Running GROMACS from a container
3. Start the Docker Container
We’ll use a similar docker run
command to start the Docker container,
with a few extra options to better emulate how containers are run in
the HTC system with HTCondor.
This command can be run verbatim except for the
username
, imagename
and tag
; these should be whatever you used to
pull or tag the container image.
- Mac/Linux:
$ docker run --user $(id -u):$(id -g) --rm=true -it \ -v $(pwd):/scratch -w /scratch \ username/imagename:tag /bin/bash
- Windows:
$ docker run --rm=true -it -v ${pwd}:/scratch -w /scratch username/imagename:tag /bin/bash
For Windows users, a window may pop up, asking for permission to share
your main drive with Docker. This is necessary for the files to be
placed inside the container. As in the previous section, the docker run
command
will start a running copy of the container and start a command line shell
inside.
What Do All the Options Mean? Part 2
The options that we have added for this example are used in CHTC to make jobs run successfully and securely.
--user $(id -u):$(id -g)
: runs the container with more restrictive permissions-v $(pwd):/scratch
: Put the current working directory (pwd
) into the container but call it/scratch
. In CHTC, this working directory will be the job’s usual working directory.-w /scratch
: when the container starts, make/scratch
the working directory
4. Test the job
Your command line prompt should have changed to look like this:
I have no name!@5a93cb:/scratch$
We can now see if the job would complete successfully!
If you have a single command or list of commands to run, start running them one by one. If you have an executable script, you can run it like so:
I have no name!@5a93cb:/scratch$ ./exec.sh
If your “executable” is software already in the container, run the appropriate command to use it.
Permission Errors
The following commands may not be necessary, but if you see messages about “Permission denied” or a bash error about bad formatting, you may want to try one (or both) of the following (replacing
exec.sh
with the name of your own executable.)You may need to add executable permissions to the script for it to run correctly:
I have no name!@5a93cb:/scratch$ chmod +x exec.sh
Windows users who are using a bash script may also need to run the following two commands:
I have no name!@5a93cb:/scratch$ cat exec.sh | tr -d \\r > temp.sh I have no name!@5a93cb:/scratch$ mv temp.sh exec.sh
When your test is done, type exit
to leave the container:
If the program didn’t work, try searching for the cause of the error messages, or email CHTC’s Research Computing Facilitators.
If your local test did run successfully, you are now ready to set up your Docker job to run on CHTC.