Lightnews — Scholar-powered news

Alexander Shlyapin

@alshlyapin.bsky.social

This is why, for now, I'll stick with ChatGPT and Copilot. When Claude 3 Opus acquires internet search capabilities, I will try it.

6/6

March 7, 2024 at 11:42 PM

Alexander Shlyapin

@alshlyapin.bsky.social

Two different models (you don't know which ones) give you answers, and you choose which answer is best. Claude 3 Opus is right behind GPT-4.

5/n

March 7, 2024 at 11:42 PM

Alexander Shlyapin

@alshlyapin.bsky.social

2) I think that arena.lmsys.org (click "Leaderboard" in the menu at the top) shows more fair results. The leaderboard is set up as follows: in the arena ("Arena (battle)" in the top menu, try it), you give a prompt.

4/n

March 7, 2024 at 11:41 PM

Alexander Shlyapin

@alshlyapin.bsky.social

If you ask an LLM without internet access about recent events, it will just refuse to answer. However, I want to note that it's possible to add internet search to Claude 3. For example, Perplexity.ai managed to do it, as they did with Claude 2.1.

3/n

March 7, 2024 at 11:41 PM

Alexander Shlyapin

@alshlyapin.bsky.social

1) GPT-4 has access to the internet. I use chatbots every day (ChatGPT Plus and Microsoft Copilot Pro, which are based on GPT-4), and the ability to search the internet is absolutely essential for me.

2/n

March 7, 2024 at 11:40 PM

Alexander Shlyapin

@alshlyapin.bsky.social

But in the end, if the model really works as well as they explain, it must be a new seminal work.

8/8

March 5, 2024 at 4:15 AM

Alexander Shlyapin

@alshlyapin.bsky.social

- Also, some people claim that they didn't cite two previous important works: arxiv.org/abs/1602.02830 (binarized NNs) and arxiv.org/abs/1609.00222 (ternary NNs).
- And most importantly, the code is not available yet, so we can't be sure it's really that good.

7/n

March 5, 2024 at 4:14 AM

Alexander Shlyapin

@alshlyapin.bsky.social

- As far as I understand, they still store gradients and the optimizer in high precision, so the difference in size during training is not that big (not 2.71 times at least). I took this information from "BitNet: Scaling 1-bit Transformers for Large Language Models."

6/n

March 5, 2024 at 4:14 AM

Alexander Shlyapin

@alshlyapin.bsky.social

- The paper does not provide detailed comparisons of hyperparameters, which could impact the performance evaluation between BitNet and LLaMA.

5/n

March 5, 2024 at 4:14 AM

Alexander Shlyapin

@alshlyapin.bsky.social

- The datasets used for training both models (BitNet and LLaMA) are reported to be the same, which helps ensure a fair comparison.
- The architectures are different, but I didn't delve into the details, so I can't say whether it's significant.

4/n

March 5, 2024 at 4:13 AM

Alexander Shlyapin

@alshlyapin.bsky.social

They compare it with LLaMA. Also, given Microsoft's involvement, the research likely adheres to high standards of quality and rigor. I quickly checked the paper to find something that would show the results are embellished but didn't find anything suspicious:

3/n

March 5, 2024 at 4:13 AM

Alexander Shlyapin

@alshlyapin.bsky.social

They managed to reduce every parameter in the model to 1.58 bits (except for activations, which are 8-bit), whereas normally it's 16 bits. They claim their model matches the 16-bit models in both perplexity and end-task performance while being faster.

2/n

March 5, 2024 at 4:13 AM

Alexander Shlyapin

@alshlyapin.bsky.social

Moreover, you receive no shares in this company, and the total sum donated by various people amounts to $130 million dollars. I can’t even comprehend how this is legal.

3/3

March 1, 2024 at 10:39 PM

Alexander Shlyapin

@alshlyapin.bsky.social

Imagine you donated money to a charity for children with cancer, and then the organization shifted from non-profit to for-profit, essentially abandoning their mission to help these children.

2/n

March 1, 2024 at 10:39 PM

Alexander Shlyapin

@alshlyapin.bsky.social

In the end, everything remains as it is, and I stay as an LLM engineer.

5/5

February 26, 2024 at 11:09 PM

Alexander Shlyapin

@alshlyapin.bsky.social

I've been thinking about this lately but haven't yet figured out how to change my life considering this new information. Should I go into coal mining? Become a cleaner? Work in a factory? Become a waiter? It doesn't seem like a logical solution.

4/n

February 26, 2024 at 11:08 PM

Alexander Shlyapin

@alshlyapin.bsky.social

It turns out that a programmer's job is easier for AI than a taxi driver's job. This led me to think. I always believed that a programmer's job (and other intellectual work) would be replaced after non-intellectual work, but it turns out to be the opposite.

3/n

February 26, 2024 at 11:07 PM

Alexander Shlyapin

@alshlyapin.bsky.social

For example, ChatGPT can already code at a junior level and write texts at a professional level, whereas autonomous vehicles are not yet fully developed (although Waymo has already launched a taxi service in San Francisco).

2/n

February 26, 2024 at 11:05 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news