João Abrantes
abranti.bsky.social
João Abrantes
@abranti.bsky.social
Emerging societies from scratch with kinship-aligned multi-agent RL 🇵🇹
- Rich game worlds: reproduce, gather, store, trade and steal resources
- Learning agents: starting with RNNs, scaling toward small transformers

I'm exploring this and documenting my findings (with code): abranti.com/the-key-prop...
November 4, 2025 at 6:00 PM
- Rich game worlds: reproduce, gather, store, trade and steal resources
- Learning agents: starting with RNNs, scaling toward small transformers

I'm exploring this and documenting my findings (with code):
abranti.com/the-key-prop...
November 4, 2025 at 5:58 PM
I like this name but Evolution is more general that biological evolution, and so is this framework

It could be memes being carried by brains

Companies maximising shareholders' value, etc

In my first paper I tried to make it general and replaced "gene" with "replicator".
November 4, 2025 at 9:51 AM
kinship is the proportion of genes shared between two agents.

since the goal of the agents is to protect and replicate their genes, agents' goal alignment is proportional to their kinship
November 4, 2025 at 9:49 AM
will work on that for future blog posts. I also want to open source more of my code so that it gets easier for others to contribute.
November 4, 2025 at 9:48 AM
I would be interested in:
1. Putting those ideas to test and measure the reduction in the Alignment Gap

2. Scale the complexity of the environment (e.g. Age of Empires but without hardcoded property rights) and scale the capabilities of the Agents (e.g. use super tiny LLMs).
November 3, 2025 at 5:44 PM
There are many theories around which behaviours cause better cooperation outside of kin, but there are very few experiments that show the efficiency of each behaviour in complex envs.
November 3, 2025 at 5:44 PM
I'm calling the Alignment Gap: to the difference in the efficiency of resource usage between perfectly aligned agents and agents with some gene diversity

I'm interested in studying what characteristics the agents need to have to reduce this Alignment Gap.
November 3, 2025 at 5:44 PM
there are periods where the gene count (of every gene) decreases with optimisation. everyone gets worse off with more optimisation.

society gets stuck in sub-optimal places and the agents can't just all agree to go back to their previous behaviour where everyone was better off.
November 3, 2025 at 5:44 PM
when training perfectly aligned agents we see population size increasingly monotonically, showing that resources are being more efficiently used.

however, this is not true when optimising agents with different genes.
November 3, 2025 at 5:44 PM
Humans share less genes so they had to overcome trust issues with complex social behaviours: status hierarchies, reputation tracking, laws and contracts.

I am sharing my thoughts, experiments, code and results here: abranti.com/the-key-prop...
Emerging a Society with MARL Part 1: The Key Properties to Reproduce Society with Multi-Agent RL
We introduce an Environment that benefits agents who can cooperate — despite having different goals. We argue that these were the conditions that shaped human social capabilities.
abranti.com
October 30, 2025 at 9:49 AM
The core thesis is that massive cooperation is relatively easy between agents that carry the same genes. For example, our trillions of cells engage in massive collaboration to help us live and reproduce.
October 30, 2025 at 9:49 AM
thanks for sharing Marc!

if anyone would like to collaborate in this direction, do reach out!

@jzleibo.bsky.social @eaduenez.bsky.social @karltuyls.bsky.social
October 30, 2025 at 9:46 AM