NOTE: Harvey was used by me for years in production; however, it's a bit rough around the edges still. I eventually had to bite the bullet and switch to Docker Swarm but Harvey remains a viable options for small projects and deployments. I will no longer be actively maintaining the project since I no longer use it daily but welcome any work on the project to smooth over any lingering oddities. A good place to start is the Harvey Wishlist.
I've long been a fan of the simplicity of a docker-compose
file and its usage. Deploying with systems such as Rancher or a self-hosted GitLab seemed too daunting with uneccessary overhead for simple projects. Why can't I use the same setup in production as I do in development? Skip the environment variable injection and key stores, configuring production machines, and separate workflow configuration by having Harvey spin up my project by using Docker Compose in production, just like I do locally.
Harvey receives a webhook from GitHub, pulls in the changes, and deploys them. If you have Slack enabled, Harvey can send you the deployment summary along with running healthchecks to ensure the deployment was successful.
- GitHub webhook fires and is received by Harvey stating that a new commit hit an enabled repo on an allowed branch to be deployed
- Harvey pulls in your changes from GitHub and builds your Docker image locally
- Harvey spins up the new Docker container and tears down the old one once it's up and running (read: not zero downtime)
- (Optional) Harvey will then run a container healthcheck to ensure your new container is up and running
- (Optional) Harvey can send a message via Slack to notify users about the status of the deployment
Because Harvey interacts directly with the Docker daemon (using sockets), to build and orchestrate Docker images and containers, Harvey cannot run in a Docker container itself and must be run on your bare-metal OS.
- Install Docker
- Ensure you've added your ssh key to the ssh agent:
ssh-add
followed by your password (alternatively, seeUSE_HTTPS_AUTH
to use HTTPS URLs instead with something like Git Credential Manager) - Setup enviornment variables as needed in the
.env
file - Enable GitHub webhooks for all your repositories you want to use Harvey with (point them to
http://example.com:5000/deploy
, send the payload as JSON)- You could alternatively setup a GitHub Action or other CI flow to only trigger a webhook event once tests pass for instance (must include all the details as if it were generated by GitHub, see Workflow Webhook for an example on how to do this.)
# Install Harvey via GitHub
git clone https://github.com/Justintime50/harvey.git
cp .env-example .env # alternatively add your .env file to the $HARVEY_PATH
just install
You can alternatively download a release and follow similar steps to the above instead of cloning the repo.
# Run locally for development (runs via Flask)
just run
# Run in production (runs via uWSGI)
just prod
- Cloning
- Harvey will shallow clone your project to the most recent commit
- Naming
- Harvey expects the container name to match the GitHub repository name exactly, otherwise healthchecks will fail
- Harvey does not handle renamed or transferred repos for you. If you rename a repo, you may need to intervene manually to shut down the old container, remove it from Harvey, and startup the new one initially on your own
- Deployments
- Initial deployments are not gracefully handled. Because Harvey requires no configuration for a project, it assumes everything is already setup on the server. This means that on an initial deploy, you will need to set environment variables on your server, migrate databases, and whatever else may be required, at which point you may need to redeploy the project for the changes to take affect
- Future deploys should then work without additional intervention unless you have specific manual steps like updating env vars or future database migrations
- Because we use threads, you cannot kill an ongoing deployment because you cannot reliably kill a thread
- Initial deployments are not gracefully handled. Because Harvey requires no configuration for a project, it assumes everything is already setup on the server. This means that on an initial deploy, you will need to set environment variables on your server, migrate databases, and whatever else may be required, at which point you may need to redeploy the project for the changes to take affect
- Logs
- Harvey automatically rotates log files and keeps them for 2 weeks before purging them
- Each repo either needs a committed
.harvey.yaml
file in the root directory which will be used whenever a GitHub webhook fires, or adata
key passed into the webhook delivered to Harvey (via something like GitHub Actions). This can be accomplished by using something like workflow-webhook or another homegrown solution (requires the entire webhook payload from GitHub. Harvey will always fallback to the.harvey.yaml
file if there is nodata
key present) - You can specify one of
deploy
orpull
as thedeployment_type
(deploy
is the default) - Optional:
prod_compose: true
json can be passed to instruct Harvey to use a proddocker-compose
file in addition to the base compose file. This will run the equivelant of the following when deploying:docker-compose -f docker-compose.yml -f docker-compose-prod.yml
and is useful to allow both local and production compose setups in a single project.
deployment_type: deploy
prod_compose: true
healthcheck:
- container_name_1
- container_name_2
deploy:
needs: ["test", "lint"]
runs-on: ubuntu-latest
steps:
- name: Deploy to Harvey
if: github.ref == 'refs/heads/main'
uses: distributhor/workflow-webhook@v2
env:
webhook_type: "json-extended"
webhook_url: ${{ secrets.WEBHOOK_URL }}
webhook_secret: ${{ secrets.WEBHOOK_SECRET }}
data: '{ "deployment_type": "deploy", "prod_compose" : true, "healthcheck": ["container_name_1", "container_name_2"] }'
Harvey's entrypoint (eg: http://127.0.0.1:5000/deploy
) accepts a webhook from GitHub. If you'd like to simulate a GitHub webhook, pass JSON like the following example to the Harvey webhook endpoint:
{
"ref": "refs/heads/main",
"repository": {
"name": "justinpaulhammond",
"full_name": "Justintime50/justinpaulhammond",
"html_url": "https://github.com/Justintime50/justinpaulhammond",
"ssh_url": "[email protected]:Justintime50/justinpaulhammond.git",
"owner": {
"name": "Justintime50"
}
},
"commits": [
{
"id": "1",
"author": {
"name": "Justintime50"
}
}
],
"data": {
"deployment_type": "deploy"
}
}
Environment Variables:
ALLOWED_BRANCHES A comma separated list of branch names that are allowed to trigger deployments from a webhook event. Default: "main,master"
DEPLOY_ON_TAG A boolean specifying if a tag pushed will trigger a deploy. Default: True
HARVEY_PATH The path where Harvey will store projects, logs, and the SQLite databases. Default: ~/harvey
HOST The host Harvey will run on. Default: 127.0.0.1
LOG_LEVEL The logging level used for the entire application. Default: INFO
OPERATION_TIMEOUT The number of seconds any given operation (git command, deploy pipeline) can take before timing out. Default: 300
PAGINATION_LIMIT The number of records to return via API. Default: 20
PORT The port Harvey will run on. Default: 5000
SENTRY_URL The URL authorized to receive sentry alerts.
SLACK_BOT_TOKEN The Slackbot token to use to authenticate each request to Slack.
SLACK_CHANNEL The Slack channel to send messages to.
USE_HTTPS_AUTH Use HTTPS URLs instead of SSH URLs to authenticate with Git. Default: False
USE_SLACK Set to "true" to send slack messages.
WEBHOOK_SECRET The Webhook secret required by GitHub (if enabled, leave blank to ignore) to secure your webhooks. Default: None
/deployments
(GET) - Retrieve a list of deployments/deployments/{deployment_id}
(GET) - Retrieve the details of a single deployment/deploy
(POST) - Deploy a project with data from a GitHub webhook/projects
(GET) - Retrieve a list of projects/projects/{project_name}/lock
(PUT) - Locks the deployments of a project/projects/{project_name}/unlock
(PUT) - Unlocks the deployments of a project/projects/{project_name}/webhook
(GET) - Retrieves the current webhook of a project/locks
(GET) - Retrieve a list of locks/locks/{project_name}
(GET) - Retrieve the lock status of a project/threads
(GET) - Retrieves a list of threads (named after projects) running via Harvey. A thread indicates an ongoing deployment
The Harvey API can be secured via Basic Auth
by setting the WEBHOOK_SECRET
env var (this secret is used for authenticating a webhook came from GitHub in addition to securing the remaining endpoints).
At this time, there is no multi-user authentication options. A single secret (password) secures the entire service. Additionally, there are no permissions. Harvey should be deployed as a runner where admins are the only ones expected to interact with it. If you want multi-user authentication, you can use Harvey UI that allows multiple users to have their own credentials (no permissions). Each of those users then authenticates with Harvey under the hood with the same WEBHOOK_SECRET
.
# Retrieve a list of deployments
curl -X GET http://127.0.0.1:5000/deployments
# Retrieve a deployment from Harvey using the full repo name where slashes are replaced with dashes
curl -X GET http://127.0.0.1:5000/deployments/justintime50-justinpaulhammond
# Get a comprehensive list of development tools
just --list