Lightnews — Scholar-powered news

Jay Patel

@infotainment.bsky.social

2.4K followers 4.2K following 2.4K posts

🎷 vibe adulting

#HCI #PeerReview #SciPub
#toolsforthought #ResearchSynthesis
#OpenScience #MetaSci #FoSci

🔎 Research: ethnography of peer review
🧑‍🏫 Teaching: Stats, DataViz

🐢 UMD: College of Info
🌐 PhD Candidate: Info Studies / HCI + Data
🏝️ OASISlab

Posts Replies Media Videos

Jay Patel

@infotainment.bsky.social

Hands down, the best PubPeer comment I've read in a while.

screenshot of a pithy PubPeer comment about a user, Deiratonotus cristatus mentioning issues with the title, abstract, article, and reviewers of a PNAS paper on longevity research in mice

November 10, 2025 at 11:12 PM

Jay Patel

@infotainment.bsky.social

While reading a data science conference paper today, I noticed an interesting transparency statement.

Is this common to data mining conferences?

I like this sort of statement; it reminds me a bit of the 21-word solution. #metascience

screenshot of transparency statement/pledge by study co-authors

November 7, 2025 at 2:47 AM

Jay Patel

@infotainment.bsky.social

And just over the past day or so, even more commentary point to conflict of interest issues (didn't declare a key patent) and issues with changing the clinical trial registration dates to make it seem as if the data were collected after declaring the study plan/registration.

screenshot of PubPeer comments like Ioana A Cristea and Thomas Kesteman pointing to issues with clinical trial registration dates and an undeclared conflict of interest

November 5, 2025 at 9:55 PM

Jay Patel

@infotainment.bsky.social

Hey, @gracewade.bsky.social why is this article continuously posted on BSky? The number of issues documented by academic researchers is worth reporting.

The story should be one of errors/potential fraud instead of a breakthrough, right?

pubpeer.com/publications...

Author reply in the thread:

screenshot of PubPeer comment by the co-author of the stem cell paper (Armin Attar). Attar thanks the commenters and mentions that they found inconsistences that will be examined over the next 2-3 weeks.

Attars thanks the community on PubPeer.

November 5, 2025 at 9:19 PM

Jay Patel

@infotainment.bsky.social

So how did you prompt it then? Typically, prompts starting with "You are an expert X with expertise in domains A,B, and C..." is effective.

The Google white paper that was published a while ago can be helpful: cloud.google.com/discover/wha...

Section: Strategies for writing better prompts

screenshot of a guide on Google Cloud to write better prompts for LLM input

November 4, 2025 at 8:22 PM

Jay Patel

@infotainment.bsky.social

I did both in my master's thesis under supervision by a mentor. What are the stats on reporting this? Would be nice to know.

apastyle.apa.org/jars/quant-t...

APA JARS-QUANT reporting guidelines mention diagnostics:

APA JARS-QUANT reporting guidelines for statistics and data analysis mention reporting regression diagnostics.

November 4, 2025 at 7:25 PM

Jay Patel

@infotainment.bsky.social

Dear Reviewer 2: Go F’ Yourself.

Another gem in the #peerreview literature, a joke paper, finds that it's Reviewer #3 who's the real problem.

The paper even has a credulous PubPeer comment!

Paper: doi:10.1111/ssqu.12824s
PubPeer: pubpeer.com/publications/80F9ACFE1DC2E6510A4CC3D2D841C1

screenshot of PubPeer comment for the article Dear Reviewer 2: Go F'Yourself

October 17, 2025 at 11:58 PM

Jay Patel

@infotainment.bsky.social

Yes! Left is a blank PubPeer page where I submitted a comment (awaiting moderation, then never accepted). Right is Paperstars with the same comment.

pubpeer page with a blank space where my post should be

paperstars review of a paper on assessing novelty with LLMs, review is posted in full

October 1, 2025 at 5:30 PM

Jay Patel

@infotainment.bsky.social

But did you get a photo?

September 4, 2025 at 6:07 PM

Jay Patel

@infotainment.bsky.social

www.lequin.co.uk/blog/when-wo...

matrix chart of country by direct vs indirect negative feedback and low context vs high context communication styles show differences even within continents

August 17, 2025 at 7:29 PM

Jay Patel

@infotainment.bsky.social

Fourth find: Disclaimers abound. Might as well place them in reporting guidelines for standard communication given how popular they are.

screenshot of text with a disclaimer that LLMs shouldn't replace human reviewers

August 15, 2025 at 1:16 AM

Jay Patel

@infotainment.bsky.social

Round 3, folks! This time in red text at the bottom of the first page.

August 14, 2025 at 1:27 AM

Jay Patel

@infotainment.bsky.social

Yes, the first screenshot is about OpenReviewer. I read that paper recently and was able to run the HuggingFace demo: huggingface.co/spaces/maxid...

Maybe try again?

The second screenshot is Liang et al. 2024: ai.nejm.org/doi/abs/10.1...

August 13, 2025 at 9:46 PM

Jay Patel

@infotainment.bsky.social

BINGO! I call BINGO! How many variants can I find?

August 13, 2025 at 3:48 AM

Jay Patel

@infotainment.bsky.social

"Can Large Language Models" returns 8k+ hits on Google Scholar.

"Should Large Language Models" returns 73 hits.

Before asking can...?, ask should...? and you'll save yourself a year's worth of research in some cases.

August 13, 2025 at 2:25 AM

Jay Patel

@infotainment.bsky.social

AI researchers love to add disclaimers about the importance of humans in research activities, but I don't see much use for this kind of thing in practice.

Those who use their tools will do so as they like.
Disclaimers won't matter much in the long-run.

August 12, 2025 at 9:05 PM

Jay Patel

@infotainment.bsky.social

"This is the first study..." in a paper makes me fume. 😤
On Google Scholar, it returns almost 2 million hits.

"This is the second study..." returns 3,840 hits.
That's a difference of ~520X.

I'm more likely to believe the latter claim.
📖 If you make a novelty claim, then back it up.

July 29, 2025 at 11:44 PM

Jay Patel

@infotainment.bsky.social

This is how you enforce reporting guidelines: @neuripsconf.bsky.social does it right.

❌ Desk reject failure to comply

Which other venues do this sort of thing? #metascience

July 29, 2025 at 12:50 AM

Jay Patel

@infotainment.bsky.social

My new favorite motto and insignia for slow and open (aka slowpen) science:

"Festina lente"
(Latin translation: Make haste slowly)

Image of a cherub on a tortoise moving slowly with surrounding cherubs, circular shape on the ceiling of the Pallazo Vecchio

July 23, 2025 at 5:58 PM

Jay Patel

@infotainment.bsky.social

If you want to run a study on LLMs' abilities, please prompt engineer thoroughly.

This is the laziest and most honest method I've seen in my review so far:

"Whether this could have influenced the results remains currently unknown... Prompt designing is also time-consuming..."

July 22, 2025 at 12:04 AM

Jay Patel

@infotainment.bsky.social

3. Sorting and filtering by sentiment would be nice on the X/Bluesky pages.

screenshot of Altmetrics page with X posts highlight and cards below with user posts plus sentiment information

July 11, 2025 at 10:13 PM

Jay Patel

@infotainment.bsky.social

2. The colored squares indicating sentiment for the X and Bluesky tabs could be more prominent. I missed them for the first minute or two on the page.

screenshot of a card in Altmetric Explorer showing a bluesky post by Professor Elizabeth McKay retweeting a critical post about a mental health study

July 11, 2025 at 10:13 PM

Jay Patel

@infotainment.bsky.social

@altmetric.com The sentiment analysis feature is wonderful for researchers.

A few thoughts from my recent use to consider:

1. Can I view a feed of papers/posts by sentiment category (e.g. only papers with post > 10% negative)? That'd be useful to find problematic papers.

Sentiment analysis stacked bar chart from Altmetric Explorer showing mostly negative posts by users on X, Bluesky, etc.

July 11, 2025 at 10:13 PM

Jay Patel

@infotainment.bsky.social

Who watches the AI agent benchmarks?

❌ 7/10 contain shortcuts or impossible tasks.

❌ 7/10 fail outcome validity.

❌ 8/10 fail to disclose known issues.

preprint: arxiv.org/abs/2507.02825
blog: ddkang.substack.com/p/ai-agent-b...

set of three bar graphs showing that 10 popular AI agent benchmarks fail on task validity, outcome validity, and benchmark reporting

July 11, 2025 at 9:18 PM

Jay Patel

@infotainment.bsky.social

Behold! Genius-level product design:

#Grok search options:

screenshot of Grok 4, buttons to select Deepsearch or DeeperSearch

July 10, 2025 at 10:00 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news