Whenever I contribute to an existing project I start off by reading the code repository history. It’s a way to understand the project evolution and how the code changed over time. Besides this it gives you an idea about how well the project is organized, which are the major bugs and bottlenecks etc. However, I could rarely find projects which maintain a clean history. The reasons are many, but one is that the commit messages are not properly formulated. In this article I’m going to exemplify my view about good commit messages in GIT.

Mess

One way to think about GIT commit messages is like fingerprints you leave on the code changes, and they should be meaningful for someone who reads them after a year or more. The context it’s also important because it often happens to check how a particular class evolved over time and which were the commits which impacted that class. If it takes you more that a blink of an eye to understand it then this is a sign to reconsider the commit message. One good comparison is with traffic signs. If it takes more that a blink of an eye to understand the signs in a road-cross while you drive then the road designers should either simplify the signs or even change the road infrastructure. However, the risk of accidents increases drastically.

Example of bad commit messages

fixed broken stuff
fixed broken stuff - part2

Well, it’s good you fixed something, but what exactly did you fix? The commit message should contain more details about what has been fixed. More, when we are in a hurry or lazy we don’t provide a new commit message in case we forgot something. We normally rely on IDE and use the same changelist with the same commit message for something we omitted previously.

Update API with 3345. Fix malformed request

Here it’s better. At least it seems there are more details. But what 3345 is? Did the APIs update fix a bug implicitly? Or are there two separate changes wrapped in a single commit? One with the API update and the other one with a fix for a some malformed request. But which request?

Proper GIT commit messages

I’ll stick with the examples above.

[FIX] Close server connection properly #788 [see details]
 
- close socket and return to the connection poll
- checks for null pointer exceptions
- unit tests for socket handlers

The commit message from above is more verbose, and it offers a clear view about the changes. It’s not needed to be very technical to understand what the commit is all about. The first line states there is a fix for a bug related to server connectivity problem. At the end there is a reference to a GitHub issue (#788) and then it invites the reader to check for more details in case he wants to go deeper.

Those additional details are written on separate lines as a hyphen list. It’s actually the body of the commit message which is quite difficult to write with the -m option of git commit using the command line. You are better off writing the message into a text editor or your IDE.

It is commonly used to provide a GitHub or Jira ticket reference in case there is used such a platform for your project. Those references are automatically shortened and converted into links for easy navigation.

[IMPLEMENT] API support for image upload #3345

Now it’s clear that the commit is about API extension to support images uploads. Similarly, it points to a reference ( #3345) for more details about the feature. In this case there are no additional details, but this is perfectly fine. Notice that you could annotate the commit messages with something like [FIX], [IMPLEMENT], [FEATURE], [UPDATE] to make it clearer what is all about. There is no rule here, they work as tags and should be meaningful enough. You could pick whatever you want as long as they are widely adopted along the project and understood by everyone.

Use a linter. But be cautious

All I have written above are suggestions to make your code repository history more organized and clearer, but don’t get too obsessed with it. The point is to get better but not overcomplicate the development process. Try to discuss it before with your teammates or managers. In case they find it too cumbersome don’t insist. Try to improve your own commits messages and stick with that.

Some projects use tools to check the commit messages style before pushing them to the remote repository. Such tools could be relatively easy integrated with GIT or GitHub (see actions).

In general, these linters validate that your commits messages follow a predefined style, like:

  • the commit message title doesn’t exceed a max length
  • the title has no trailing whitespaces
  • second line is empty. Notice from my example above that there is a space between the title and commit message body
  • title doesn’t contain hard tab characters etc.