streaming: rely on object sources to create object stream - git.git

diff options

author	Patrick Steinhardt <ps@pks.im>	2025-11-23 19:59:37 +0100
committer	Junio C Hamano <gitster@pobox.com>	2025-11-23 12:56:45 -0800
commit	4c89d31494bff4bde6079a0e0821f1437e37d07b (patch)
tree	2aecc5b20716f47ba37dbf596727d324e6dabbf9 /entry.c
parent	385e18810f10ec0ce0a266d25da4e1878c8ce15a (diff)
download	git-4c89d31494bff4bde6079a0e0821f1437e37d07b.tar.gz

streaming: rely on object sources to create object stream

When creating an object stream we first look up the object info and, if it's present, we call into the respective backend that contains the object to create a new stream for it. This has the consequence that, for loose object source, we basically iterate through the object sources twice: we first discover that the file exists as a loose object in the first place by iterating through all sources. And, once we have discovered it, we again walk through all sources to try and map the object. The same issue will eventually also surface once the packfile store becomes per-object-source. Furthermore, it feels rather pointless to first look up the object only to then try and read it. Refactor the logic to be centered around sources instead. Instead of first reading the object, we immediately ask the source to create the object stream for us. If the object exists we get stream, otherwise we'll try the next source. Like this we only have to iterate through sources once. But even more importantly, this change also helps us to make the whole logic pluggable. The object read stream subsystem does not need to be aware of the different source backends anymore, but eventually it'll only have to call the source's callback function. Note that at the current point in time we aren't fully there yet: - The packfile store still sits on the object database level and is thus agnostic of the sources. - We still have to call into both the packfile store and the loose object source. But both of these issues will soon be addressed. This refactoring results in a slight change to semantics: previously, it was `odb_read_object_info_extended()` that picked the source for us, and it would have favored packed (non-deltified) objects over loose objects. And while we still favor packed over loose objects for a single source with the new logic, we'll now favor a loose object from an earlier source over a packed object from a later source. Ultimately this shouldn't matter though: the stream doesn't indicate to the caller which source it is from and whether it was created from a packed or loose object, so such details are opaque to the caller. And other than that we should be able to assume that two objects with the same object ID should refer to the same content, so the streamed data would be the same, too. Signed-off-by: Patrick Steinhardt <ps@pks.im> Signed-off-by: Junio C Hamano <gitster@pobox.com>

Diffstat (limited to 'entry.c')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: