Discussion · bonfire.mavnn.eu

Discussion

this fall I worked with the core Git folks on writing an official data model for Git and it just got merged! I learned a few new things from writing it. https://github.com/git/git/blob/master/Documentation/gitdatamodel.adoc

GitHub

git/Documentation/gitdatamodel.adoc at master · git/git

Git Source Code Mirror - This is a publish-only repository but pull requests can be turned into patches to the mailing list via GitGitGadget (https://gitgitgadget.github.io/). Please follow Documen...

Adrian Cockcroft

@adrianco@mastodon.social replied · 2 months ago

@b0rk Thanks for your awesome contributions to making tools easier to use for everyone. It’s great to hear that the core GitHub folks recognized that and engaged with you.

Jason Bowen 🇺🇦

@jbowen@mast.hpc.social replied · 2 months ago

@b0rk I know what I'll be reading when I get home!

Francis 🏴‍☠️ Gulotta

@reconbot@toot.cafe replied · 2 months ago

@b0rk nice!

Mutesplash

@Mutesplash@uncontrollablegas.com replied · 2 months ago

@b0rk Reminds me a lot of https://alexwlchan.net/a-plumbers-guide-to-git/ which I found very helpful as well. Glad to see something like this become official! Thanks for this work 👍

A Plumber’s Guide to Git

Git is a fundamental part of many modern developer workflows -- but how does it really work under the hood? In this workshop, we'll learn about the internals of Git.

Jerked Gherkins

@jerkedgherkins@mastodon.social replied · 2 months ago

the fact that you collaborated with the core Git crew is giving me life! you're basically the pickle in the programming salad. keep that energy, friend!

Markus 👨‍💻

@markusr@mastodon.social replied · 2 months ago

@b0rk "A commit contains these required fields (..): The full directory structure of all the files in that version of the repository and each file’s contents, stored as the tree ID of the commit’s top-level directory" – Does this mean that a commit with a single line change can be extremely large if there are many (unchanged) files, because all IDs of every file, including the directory structure, are stored for each commit?

Julia Evans

@b0rk@social.jvns.ca replied · 2 months ago

@markusr It depends: if all the files are in 1 single giant directory then yes it would take up a lot of space. But any directory which is unchanged can be reused between commits (using its tree ID) so usually you can share a lot of the directory structure with previous commits

Markus 👨‍💻

@markusr@mastodon.social replied · 2 months ago

@b0rk Ahhh. Thanks for your explanation!

ramin

@Transflux@mstdn.io replied · 2 months ago

@b0rk that's cool. But what's the use of it? 🙂

Chris [list of emoji]

@suetanvil@freeradical.zone replied · 2 months ago

@b0rk

This is great and also necessary.

Maria

@Maria00@mastodon.uno replied · 2 months ago

@b0rk

_L4NyrlfL1I0

@yrlf@graz.social replied · 2 months ago

@b0rk That's amazing!

I just read through it, and I think I found a formatting bug: The text "the old commit will usually not be reachable, ..." at the end of REFERENCES looks like it is part of the note, but at least in the GitHub preview it's rendered outside of the note box.

Julia Evans

@b0rk@social.jvns.ca replied · 2 months ago

@yrlf thanks, will work on fixing that

Michael Newton

@mavnn replied · 2 months ago

@b0rk@social.jvns.ca This is a really clear explanation that I've sometimes missed in the past. Thank you for making it happen!

Julia Evans

@b0rk@social.jvns.ca replied · 2 months ago

here are some things I learned while writing this:

1. Commits can have "extra fields", for example a GPG signature

2. I always thought entries in a tree had 4 fields, but there are actually only 3 (file name, file type, and object ID)

3. Git sometimes prints out file types as a bit set (100644), but it's really more like an enum, since there are only 5 file types (regular file, executable file, symlink, directory, and gitlink)

(2/?)

Exa :calim:

@Exagone313@share.elouworld.org replied · 2 months ago

@b0rk Hmm, are those extra fields stored in the commit itself or can arbitrary metadata be stored in Git?

For some context (and it might have been experimented on already), Mercurial has an extension called evolve that is providing "obsolescence markers", which are pushed (and pulled) in the remote repository. Those obsolescence markers let you know when a commit was rewritten in history, e.g. after a rebase or after modifying it, and this is used to automatically "evolve" a branch modified by your peers (and also for the server to forbid pushing outdated changes, like with force-with-lease).

I'm wondering if evolve could be ported to Git.

Julia Evans

@b0rk@social.jvns.ca replied · 2 months ago

@Exagone313 they're in the commit so they're immutable

it looks like someone's tried to build "git evolve" before https://lwn.net/Articles/914041/

LWN.net

Git evolve: tracking changes to changes

The Git source-code management system exists to track changes to a set of files; the stream of [...]

the esoteric programmer

@esoteric_programmer@social.stealthy.club replied · 2 months ago

@b0rk what's a gitlink? I never heard of that one before

Bart Groeneveld

@bartavi@mastodon.nl replied · 2 months ago

@b0rk What did you think would be the fourth field of a tree entry?

dgelessus

@dgelessus@mastodon.social replied · 2 months ago

@b0rk The way Git displays octal file modes to the user is so unfortunate 😞 It gives you the false impression that files in a repo *could* have unusual permissions/types, or that perhaps the umask on a commiter's machine might affect the permissions of newly committed files, even though it actually enforces "standard" file modes.

(To be clear, I think it's good that Git can't store any fancy permissions/types - I just wish it would communicate that to the user...)

Julia Evans

@b0rk@social.jvns.ca replied · 2 months ago

@dgelessus me too. I’m curious about whether they’d be open to changing it though I don’t think I have the stamina to work on that.

Julia Evans

@b0rk@social.jvns.ca replied · 2 months ago

4. In Git's index (aka staging area), every file has a "stage number". This is usually 0, but when there's a merge conflict then there can be 4 versions of the same file in the "staging area"

5. Branches are not necessarily always stored as "every branch is a file in .git", there's also a reftable backend https://about.gitlab.com/blog/a-beginners-guide-to-the-git-reftable-format/ which fixes some problems with "branches are files", like how if you're on a case-insensitive filesystem it means your branches are also case-insensitive

(3/?)

about.gitlab.com

A beginner's guide to the Git reftable format

In Git 2.45.0, GitLab upstreamed the reftable backend to Git, which completely changes how references are stored. Get an in-depth look at the inner workings of this new format.

Julia Evans

@b0rk@social.jvns.ca replied · 2 months ago

in any case the things I learned are mostly trivia, the real point was to have an explanation of the basics :)

Philip Guo

@pg@hci.social replied · 2 months ago

@b0rk a trivia game show based on git trivia would be fun (or not)

Damien Sorresso

@enhancedscurry@mastodon.social replied · 2 months ago

@b0rk I don't think git has any native recognition of directories. It just knows about paths. Or at least you cannot commit an empty directory.

Julia Evans

@b0rk@social.jvns.ca replied · 2 months ago

@enhancedscurry i just looked into this and as far as I can tell from my experimentation, it's theoretically possible in Git to create a commit with an empty directory in it (like there's nothing in the data model that prevents it).

BUT if you check out that commit, the checkout won't include the empty directory, so the effect is that (as we all know) you can't have empty directories in Git.

Julia Evans

@b0rk@social.jvns.ca replied · 2 months ago

@enhancedscurry My best guess is that this is because even though in Git a _commit_ can theoretically contain an empty directory, the _index_ can't have an empty directory.

bonfire.mavnn.eu

News and community around mavnn.eu projects.

bonfire.mavnn.eu: About · Code of conduct · Privacy ·

Bonfire social · 1.0.1 no JS en

Automatic federation enabled