-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
This creates a simple specialized object allocator for basic objects. This avoids wasting space with malloc overhead (metadata and extra alignment), since the specialized allocator knows the alignment, and that objects, once allocated, are never freed. It also allows us to track some basic statistics about object allocations. For example, for the mozilla import, it shows object usage as follows: blobs: 627629 (14710 kB) trees: 1119035 (34969 kB) commits: 196423 (8440 kB) tags: 1336 (46 kB) and the simpler allocator shaves off about 2.5% off the memory footprint off a "git-rev-list --all --objects", and is a bit faster too. [ Side note: this concludes the series of "save memory in object storage". The thing is, there simply isn't much more to be saved on the objects. Doing "git-rev-list --all --objects" on the mozilla archive has a final total RSS of 131498 pages for me: that's about 513MB. Of that, the object overhead is now just 56MB, the rest is going somewhere else (put another way: the fact that this patch shaves off 2.5% of the total memory overhead, considering that objects are now not much more than 10% of the total shows how big the wasted space really was: this makes object allocations much more memory- and time-efficient). I haven't looked at where the rest is, but I suspect the bulk of it is just the pack-file loading. It may be that we should pack the tree objects separately from the blob objects: for git-rev-list --objects, we don't actually ever need to even look at the blobs, but since trees and blobs are interspersed in the pack-file, we end up not being dense in the tree accesses, so we end up looking at more pages than we strictly need to. So with a 535MB pack-file, it's entirely possible - even likely - that most of the remaining RSS is just the mmap of the pack-file itself. We don't need to map in _all_ of it, but we do end up mapping a fair amount. ] Signed-off-by: Linus Torvalds <[email protected]> Signed-off-by: Junio C Hamano <[email protected]>
- Loading branch information
Linus Torvalds
authored and
Junio C Hamano
committed
Jun 20, 2006
1 parent
64e86c5
commit 855419f
Showing
7 changed files
with
67 additions
and
5 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,51 @@ | ||
/* | ||
* alloc.c - specialized allocator for internal objects | ||
* | ||
* Copyright (C) 2006 Linus Torvalds | ||
* | ||
* The standard malloc/free wastes too much space for objects, partly because | ||
* it maintains all the allocation infrastructure (which isn't needed, since | ||
* we never free an object descriptor anyway), but even more because it ends | ||
* up with maximal alignment because it doesn't know what the object alignment | ||
* for the new allocation is. | ||
*/ | ||
#include "cache.h" | ||
#include "object.h" | ||
#include "blob.h" | ||
#include "tree.h" | ||
#include "commit.h" | ||
#include "tag.h" | ||
|
||
#define BLOCKING 1024 | ||
|
||
#define DEFINE_ALLOCATOR(name) \ | ||
static unsigned int name##_allocs; \ | ||
struct name *alloc_##name##_node(void) \ | ||
{ \ | ||
static int nr; \ | ||
static struct name *block; \ | ||
\ | ||
if (!nr) { \ | ||
nr = BLOCKING; \ | ||
block = xcalloc(BLOCKING, sizeof(struct name)); \ | ||
} \ | ||
nr--; \ | ||
name##_allocs++; \ | ||
return block++; \ | ||
} | ||
|
||
DEFINE_ALLOCATOR(blob) | ||
DEFINE_ALLOCATOR(tree) | ||
DEFINE_ALLOCATOR(commit) | ||
DEFINE_ALLOCATOR(tag) | ||
|
||
#define REPORT(name) \ | ||
fprintf(stderr, "%10s: %8u (%zu kB)\n", #name, name##_allocs, name##_allocs*sizeof(struct name) >> 10) | ||
|
||
void alloc_report(void) | ||
{ | ||
REPORT(blob); | ||
REPORT(tree); | ||
REPORT(commit); | ||
REPORT(tag); | ||
} |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters