Commit Indexer Plugin Module
Introduction
Bitbucket Server has a lightweight indexing process that occurs every time new commits are pushed to its repositories. Commit indexer plugin modules let you extend the indexing pipeline and store additional indexed data. This allows your plugin to, for example, watch commit messages or other commit metadata for particular content or efficiently store additional information about commits for later retrieval.
Example commit indexers might include:
- An indexer that looks for particular substring or pattern in a commit message, for example an issue or build tracking number (this is how Bitbucket Server's bundled JIRA key indexing is implemented).
- An indexer that records commit statistics to be used later for generating reports or charts.
Commit indexers should be fast. The indexing process is applied to every commit pushed to Bitbucket Server, so long-running or computationally expensive indexing operations can have a serious impact on server performance. Where possible, indexers should avoid making database queries or calling service methods which might fork git
processes. If those types of operations are required for the indexer to work, look for ways to do that processing in batches to reduce the overhead from repeatedly accessing the database or spawning new processes.
Commit indexers may be invoked concurrently, so they should avoid storing local state. Indexers are provided with a simple key/value store that can be used to store such state in a way that is associated with a specific indexing run. State stored in the IndexingContext
is not added to the CommitIndex
! To add new properties to the index, you must call CommitIndex.addProperty(String, String, String)
. Key/value pairs stored in the IndexingContext
are discarded when the indexing run completes.
See the CommitIndexer
documentation for more details.
Dos and Don'ts
- don't do expensive operations for every commit. It will slow down other indexers. If you must do expensive operations, consider offloading them to background threads. In this context any operation involving I/O is considered expensive.
- do handle seeing the same commit twice. Write your indexers in such a way that they can handle getting the same commit multiple times, both in the same repository and across different repositories.
- do consider memory usage. Your indexer may get called 100k times in a single indexing run. Make sure you don't keep lots of stuff in memory.
- do consider if bulk processing is more efficient than processing each commit individually. For example, consider recording added/removed hashes and doing bulk processing in the
onAfterIndexing
callback, or scheduling additional bulk processing in theofAfterIndexing
callback.
- do consider permissions. The indexer gets all commits for all repositories. What you do with it is up to you, but consider that when you present the information in the UI, the user may not have permission to see the commit in any repository.
Configuration
The root element for the Commit Indexer plugin module is <commit-indexer/>
. It allows the following attributes for configuration:
Attributes
Name | Required | Description | Default |
---|---|---|---|
key | The identifier of the plugin module. This key must be unique within the plugin where it is defined. | N/A | |
class | The fully qualified Java class name of the indexer. This class must implement CommitIndexer. | N/A |
Example
Here is an example atlassian-plugin.xml
file containing a single commit indexer:
x1<atlassian-plugin name="My Commit Indexer" key="example.plugin.indexer" plugins-version="2">
2<plugin-info>
3<description>My Commit Indexer Test</description>
4<vendor name="My Company" url="http://www.mycompany.com"/>
5<version>1.0</version>
6</plugin-info>
7
8<commit-indexer key="myCommitIndexer" class="com.mycompany.example.plugin.MyCommitIndexer"/>
9</atlassian-plugin>
And here's an example CommitIndexer
implementation that indexes whether commits have been committed by an author from Atlassian:
451public class MyCommitIndexer implements CommitIndexer {
2
3private final CommitIndex index;
4
5public MyCommitIndexer(CommitIndex index) {
6this.index = index;
7}
8
9
10
11public String getId() {
12// must be unique across *all* indexers
13return "atlassian-author-indexer";
14}
15
16
17public boolean isEnabledForRepository( Repository repository) {
18// allows conditionally indexing repositories
19return true;
20}
21
22
23public void onAfterIndexing( IndexingContext context) {
24// no-op, no tear down required
25}
26
27
28public void onBeforeIndexing( IndexingContext context) {
29// no-op, no setup required
30}
31
32
33public void onCommitAdded( Commit commit, IndexingContext context) {
34// set the 'byAtlassian' property if the author's email address ends in '@atlassian.com'
35String email = commit.getAuthor().getEmailAddress();
36if (email != null && email.endsWith("@atlassian.com")) {
37index.addProperty(commit.getId(), "byAtlassian", "true");
38}
39}
40
41
42public void onCommitRemoved( Commit commit, IndexingContext context) {
43// no-op, no clean up required - indexed properties are generally cleaned up automatically
44}
45}
Related module descriptors
Usually the methods on CommitIndex
and CommitService
are sufficient for dealing with properties. However, the Commit Property Config Plugin Module can be used to automatically decorate commits returned by Bitbucket Server's Java API and REST API with properties indexed by your plugin.