Skip to content

Next-Generation, Cloud-Native, Distributed Artefact Management System

Notifications You must be signed in to change notification settings

specialedge/hangar

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 

Repository files navigation

Hangar

Next-Generation, Cloud-Native, Distributed Artefact Management System

Concept

For most, the artefact management systems in the market are fine. They let you upload and download artefacts and if you are feeling extra special, you might actually put authentication on it. Other than that, the system does what it needs to do to allow you to share artefacts in a company without too much hassle.

However, as you start to scale, artefact management becomes a blocker. The costs of storage, networking, operations (back-ups/restorations/uptime) start to add up but more fundamental than that, there's a cost to your business through a slowing down of delivery velocity as your builds get longer and your developers are sitting about waiting.

When you hit that point, there are no artefact management solutions on the market that can help you.

Why is this different?

Hangar is different than any other system on the market as it assumes that you are working in the cloud rather than in a datacentre. That seems like marketing nonsense but there are some serious differences when architecting for the cloud that no offering on the market takes into account (or have tried to shoehorn little bits to make it seem like they've have).

By being cloud-native from the very start, Hangar will assume everything you do will involve a cost so you are trying to run as little as possible to serve a requirement. Some examples...

  • Your build agents are immutable and scale up and down over the course of the day as you don't want to be running agents that aren't doing anything.
  • The storage of artefacts in a disk will be expensive over time, especially frustrating are those artefacts that have only been downloaded once but are still in storage.
  • With blue/green deployments having been around for almost a decade, you consider downtime to be a travesty and avoided when possible.
  • Moving artefacts around your estate will incur a cost (cross-AZ, NAT gateway etc) so you'd rather send as little data as possible.
  • You don't expect to have to operate a service that may exist natively in your cloud provider - one of the main reasons you moved there actually...

Is this too much to ask? We don't think so.

How does it work?

Here's the initial design for the v1.0 state. It may change over time and we plan to deliver smaller milestones along the way to get the value as soon as possible.

Hangar Diagram

  • Build Agents : Each agent will have a Hangar Daemon on it, managing artefacts local to the agent.
    • It will use torrenting software to retrieve any dependencies from other agents or from the central API as versions of artefacts can be very similar when working with continuous delivery/deployment.
    • The daemon will be responsible for uploading the artefact, with the option of having it work asynchronously so you can get on with the rest of your build.
    • Metrics about your artefact transport will be uploaded to the data science module, allowing a better picture of what artefacts are being used. On start-up of the agent, the daemon will contact the data science module and download only the most frequently downloaded artefacts.
  • Lifecycle Management : Storage of artefacts will be managed by this module, moving artefacts into cheaper storage as they become less frequently required then delete them (if requested) after a period of time.
  • Data Science : Data about how these artefacts are being used is held here, allowing us to actively pre-empt the requirement for artefacts in different locations, advise the lifecycle management module about what to delete and giving the visualisation module a basis for explaining the system. Being data-driven is a good thing.
  • Visualisation : To allow the engineers to see what is going on in the platform and what is available.
  • API : Where the work happens. Provides a torrent master for the build agents, deals with uploading and downloading of artefacts for engineers. Very similar code to the build agent daemons and will be stateless, allowing a rolling update without downtime.

The inital deliverables for this project are as follows...

v0.1 - Initial API supporting Maven / Gradle downloads

About

Next-Generation, Cloud-Native, Distributed Artefact Management System

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published