Robert Zych
banner
zychr.bsky.social
Robert Zych
@zychr.bsky.social
Software Engineer @ Raft, Apache Pinot Contributor, lives in Sacramento area.
Pinned
For this demo I generated a 100TB data set (50GB/min) and maintained sub-second query latencies

teamraft.com/resources/in...
Secure & Low-Latency Queries at scale with Raft Data Platform - [R]DP - Raft | Operationalize Data & Agentic AI
Raft is a DefenseTech Product company on a mission to connect humans with data at the edge. One of our products is the Raft Data Platform -[R]DP. We’ve built the [R]DP and operationalized it in severa...
teamraft.com
March 8, 2025 at 8:49 PM
Reposted by Robert Zych
The JVM internals series - #5 is out..

This episode lays out a good foundation for us to go into the internals of each of the amazing GCs and their tradeoffs.

Become a member to access the upcoming internals episode.. Meanwhile watch this video (free access until tomorrow). youtu.be/p1Yamdly0QE
February 2, 2025 at 12:47 PM
With @apachepinot.bsky.social I can query a 100TB data set and using it’s Star-tree index can get 20ms query latency!
January 25, 2025 at 5:22 PM
@startreedata.bsky.social Thank you and Merry Christmas! My community box arrived last night and with my winnings from the MI competition I recently purchased a pair of skis. #startree
December 17, 2024 at 5:31 PM
@apachepinot.bsky.social It's a bit early for xmas, but thanks for all the features! youtu.be/gwcL7O0o6W4?...
Meetup: Apache Pinot Year in Review 2024
YouTube video by StarTree
youtu.be
December 7, 2024 at 12:00 AM
December 4, 2024 at 7:20 PM
Last night, I was exploring thread safety issues with ChatGPT. It only took two attempts to generate a test that exposed the lack of a synchronized block for a section that updates multiple member variables. #java
December 3, 2024 at 3:58 PM
What does this program output? #java
November 19, 2024 at 3:55 AM
Reposted by Robert Zych
90 days ago Bluesky had 6.18M users
30 days ago Bluesky had 10.85M users
Tomorrow Bluesky will have 15M users

This is absolutely wild, thanks for believing in us y'all
November 13, 2024 at 6:50 AM
In which cases is performance more important than correctness?

Approximations. Like Pinot’s DISTINCTCOUNTHLL which returns an approximate distinct count using HyperLogLog.

Does anyone else have a good example?
Approximations.like
November 5, 2024 at 2:48 AM
Working on JSON predicate pushdown for the Trino-Pinot connector:

contains((1,2,3), json_extract_scalar(json, '$.key`)) -> json_match(json, '$.key in (1,2,3)')

json_array_contains(json_extract(json, '$.array'), value) -> json_match(json, '$.array[*] = value')

json_match uses Pinot's JSON index
October 29, 2024 at 3:20 PM
It’s a good sign when a friend surprisingly welcomes you on your 1st day at a new job. Thanks @nbuesing.bsky.social !
August 8, 2023 at 2:28 AM