Alexander Shlyapin
alshlyapin.bsky.social
Alexander Shlyapin
@alshlyapin.bsky.social
NLP Engineer (LLM)
This is why, for now, I'll stick with ChatGPT and Copilot. When Claude 3 Opus acquires internet search capabilities, I will try it.

6/6
March 7, 2024 at 11:42 PM
Two different models (you don't know which ones) give you answers, and you choose which answer is best. Claude 3 Opus is right behind GPT-4.

5/n
March 7, 2024 at 11:42 PM
2) I think that arena.lmsys.org (click "Leaderboard" in the menu at the top) shows more fair results. The leaderboard is set up as follows: in the arena ("Arena (battle)" in the top menu, try it), you give a prompt.

4/n
March 7, 2024 at 11:41 PM
If you ask an LLM without internet access about recent events, it will just refuse to answer. However, I want to note that it's possible to add internet search to Claude 3. For example, Perplexity.ai managed to do it, as they did with Claude 2.1.

3/n
March 7, 2024 at 11:41 PM
1) GPT-4 has access to the internet. I use chatbots every day (ChatGPT Plus and Microsoft Copilot Pro, which are based on GPT-4), and the ability to search the internet is absolutely essential for me.

2/n
March 7, 2024 at 11:40 PM
But in the end, if the model really works as well as they explain, it must be a new seminal work.

8/8
March 5, 2024 at 4:15 AM
- Also, some people claim that they didn't cite two previous important works: arxiv.org/abs/1602.02830 (binarized NNs) and arxiv.org/abs/1609.00222 (ternary NNs).
- And most importantly, the code is not available yet, so we can't be sure it's really that good.

7/n
March 5, 2024 at 4:14 AM
- As far as I understand, they still store gradients and the optimizer in high precision, so the difference in size during training is not that big (not 2.71 times at least). I took this information from "BitNet: Scaling 1-bit Transformers for Large Language Models."

6/n
March 5, 2024 at 4:14 AM
- The paper does not provide detailed comparisons of hyperparameters, which could impact the performance evaluation between BitNet and LLaMA.

5/n
March 5, 2024 at 4:14 AM
- The datasets used for training both models (BitNet and LLaMA) are reported to be the same, which helps ensure a fair comparison.
- The architectures are different, but I didn't delve into the details, so I can't say whether it's significant.

4/n
March 5, 2024 at 4:13 AM
They compare it with LLaMA. Also, given Microsoft's involvement, the research likely adheres to high standards of quality and rigor. I quickly checked the paper to find something that would show the results are embellished but didn't find anything suspicious:

3/n
March 5, 2024 at 4:13 AM
They managed to reduce every parameter in the model to 1.58 bits (except for activations, which are 8-bit), whereas normally it's 16 bits. They claim their model matches the 16-bit models in both perplexity and end-task performance while being faster.

2/n
March 5, 2024 at 4:13 AM
Moreover, you receive no shares in this company, and the total sum donated by various people amounts to $130 million dollars. I can’t even comprehend how this is legal.

3/3
March 1, 2024 at 10:39 PM
Imagine you donated money to a charity for children with cancer, and then the organization shifted from non-profit to for-profit, essentially abandoning their mission to help these children.

2/n
March 1, 2024 at 10:39 PM
In the end, everything remains as it is, and I stay as an LLM engineer.

5/5
February 26, 2024 at 11:09 PM
I've been thinking about this lately but haven't yet figured out how to change my life considering this new information. Should I go into coal mining? Become a cleaner? Work in a factory? Become a waiter? It doesn't seem like a logical solution.

4/n
February 26, 2024 at 11:08 PM
It turns out that a programmer's job is easier for AI than a taxi driver's job. This led me to think. I always believed that a programmer's job (and other intellectual work) would be replaced after non-intellectual work, but it turns out to be the opposite.

3/n
February 26, 2024 at 11:07 PM
For example, ChatGPT can already code at a junior level and write texts at a professional level, whereas autonomous vehicles are not yet fully developed (although Waymo has already launched a taxi service in San Francisco).

2/n
February 26, 2024 at 11:05 PM