Lightnews — Scholar-powered news

Konstantinos Kitsios @kitsios.bsky.social · 9d

To mitigate this drop, we propose and evaluate the use of contrastive learning, which naturally ranks similarity between objects, thus enabling more effective semantic clone detection in-the-wild.

🙏 Huge thanks to my amazing co-authors, Francesco Sovrano, Earl T. Barr, and @sback.it. [4/4]

Konstantinos Kitsios @kitsios.bsky.social · 9d

By evaluating six models on clones of unseen functionality, we observe a significant performance drop for models explicitly trained for clone detection. For general-purpose LLMs, the drop is lower, but still exists. [3/4]

1

Konstantinos Kitsios @kitsios.bsky.social · 9d

SOTA clone detection models are trained on clones of specific functionalities and tested on different clones of the same functionalities. But in practice, developers need to identify clones of functionalities the models have not been trained on. How well do models perform in such scenarios? [2/4]

1

Konstantinos Kitsios @kitsios.bsky.social · 9d

🧩 Can semantic code clone detectors really detect clones in-the-wild?

🎉 We address this question in our paper “Detecting Semantic Clones of Unseen Functionality,” recently accepted to @aseconf.bsky.social!

📄 Pre-print: arxiv.org/abs/2510.04143
💻 Code: github.com/kitsiosk/uns...
[1/4]

1 2 4

Reposted by Konstantinos Kitsios

Proton @proton.me · 29d

Europe’s digital backbone is built on foreign rails.

In 🇩🇪 DE (58%), 🇦🇹 AT (59%), 🇧🇪 BE (80%), 🇮🇹 IT (69%), 🇱🇺 LU (78%), and 🇳🇱 NL (81%), publicly listed companies rely on US email, and the wider stack behind it.

Read the full study for additional details. 👇

Europe’s tech sovereignty watch | Proton for Business

Europe’s biggest businesses run on US tech — putting its privacy and sovereignty at risk. Read our study on how bad the problem is and why we urgently need a Europe-first tech policy.

proton.me

5 28 99

Reposted by Konstantinos Kitsios

daniel:// stenberg:// @bagder.mastodon.social.ap.brid.gy · Sep 8

My keynote from Open Source Summit Europe 2025 is now up. 13 pretty packed minutes.

https://youtu.be/YEBBPj7pIKo?si=DBxSCFuqkFQBRdOw

5 7

Reposted by Konstantinos Kitsios

Haskell programming language @haskell.org · Sep 4

Maybe listening to Greek ρεμπέτικο (rebetiko) while programming will fill our souls with meaning that LLMs can never bring us.

1 2 27

Konstantinos Kitsios @kitsios.bsky.social · Sep 3

🙏 Many thanks to Marco Castelluccio for the great collaboration, and to @sback.it for his invaluable mentorship during this work.

📄 Preprint: arxiv.org/abs/2509.01616
💻 Code: github.com/kitsiosk/blast

#ASE2025

5/5

1

Konstantinos Kitsios @kitsios.bsky.social · Sep 3

We deployed BLAST in three open-source repositories from @mozilla.org, where it proposed 11 fail-to-pass tests to the developers, 6 of which were confirmed to reproduce the designated issue. This calls for scrutiny towards the widely used fail-to-pass metric, which we discuss in detail. 4/5

1

Konstantinos Kitsios @kitsios.bsky.social · Sep 3

BLAST generates such fail-to-pass tests in 151 out of 426 (35.4%) issue-patch pairs from a widely used benchmark, outperforming state-of-the-art approaches while requiring only 2 LLM queries and 1 minute of lightweight SBST generation. 3/5

1

Konstantinos Kitsios @kitsios.bsky.social · Sep 3

We introduce BLAST, a tool that combines LLMs and Search-Based Software Testing (SBST) to generate tests that fail before a patch and pass after. 2/5

1

Konstantinos Kitsios @kitsios.bsky.social · Sep 3

🐞 After a bug is patched, how can we increase our confidence that it will not reappear in the future?

We address this question in our paper recently accepted to @aseconf.bsky.social 2025! 🎉 1/5

1 2 4

Reposted by Konstantinos Kitsios

Gergely Orosz @gergely.pragmaticengineer.com · Mar 26

AI crawlers are wrecking the open internet.

My small side project - techpays .com - used to generate below 100GB of traffic per month. It’s on Render where 500GB/month included, above it’s $30 per 100GB.

Meta’s AI crawler + other bots have pushed it to 700GB+ per month

WTH

25 120 550

Reposted by Konstantinos Kitsios

Mozilla @mozilla.org · Mar 24

Users deserve data protections. Sign our petition to help them get it. mzl.la/41YuAPG

2 10 24

Konstantinos Kitsios @kitsios.bsky.social · Mar 2

Already mentioned by someone else here, but is there any practical way a dev could support 🇺🇦?

1 1

Reposted by Konstantinos Kitsios

Cornelius Aschermann @is-eqv.bsky.social · Feb 4

aischolar.0x434b.dev Pretty cool project by @434b.bsky.social: A neat web interface to explore security (and in particular: Fuzzing) papers with AI summaries. Seems super useful to get/stay up to date with recent papers :)

AIScholar - Paper Database

aischolar.0x434b.dev

5 10

Reposted by Konstantinos Kitsios

Gergely Orosz @gergely.pragmaticengineer.com · Jan 29

As a software eng, it is inherently satisfying to see an open approach beat close approaches in an innovative field.

Linux is open: Windows is closed

Llama, Deepseek, Mistral are open: OpenAI, Gemini, Anthropic& many others others closed

Closed approaches winning almost always lead to monopolies.

20 53 710

Konstantinos Kitsios @kitsios.bsky.social · Jan 23

Congrats!

1

Reposted by Konstantinos Kitsios

Sung Kim @sungkim.bsky.social · Jan 19

This is just a reminder that training on test data is all you need to achieve SOTA perf

OpenAI had access to all of FrontierMath data from the beginning, but they verbally agreed that data would not be used in model training. Although there was a legal agreement not to disclose the partnership

2 1 16

Reposted by Konstantinos Kitsios

Daniel Lakens @lakens.bsky.social · Jan 11

Listened to the Telepathy Tapes. It is a great illustration of what you get when incompetent people try to do science. The show is actively misleading (and the makers know it) with purely political and financial goals (the rest is confirmation bias combined with incompetence).

2 11

Konstantinos Kitsios @kitsios.bsky.social · Jan 9

Wow, these are indeed top-notch researchers in the field, totally worth reading some of their work.

Konstantinos Kitsios @kitsios.bsky.social · Jan 6

Staubbach Falls in the Swiss Alps 🇨🇭

Waterfall coming down from a mountain hill

4

Reposted by Konstantinos Kitsios

Gergely Orosz @gergely.pragmaticengineer.com · Jan 5

What Dutch directness looks like. The CEO of ASML was pushed by US partners how ASML supplying devices to China could enable eg actions against the Uyghurs. He responded asking how this is different to what gun manufacturers might enable!

From Focus: the ASML Way by Marc Hijink

15 31 280

Reposted by Konstantinos Kitsios

Daniel Lakens @lakens.bsky.social · Jan 4

The more simplistic the take you see on here, the more it generalizes, the more it is an act of politics, not science.

If the take comes from a scientist, they know they have no evidence for their take, or they would use it.

1 3 3

Konstantinos Kitsios @kitsios.bsky.social · Dec 26

Here is also a VM from scratch in Python instead of C, by Greg Wilson: third-bit.com/sdxpy/vm/

1 7