csnweb's favorites | Hacker News

1.		Show HN: OpenWorkers – Self-hosted Cloudflare workers in Rust (openworkers.com)
		500 points by max_lt 12 days ago \| 158 comments
2.		Valdi – A cross-platform UI framework that delivers native performance (github.com/snapchat)
		534 points by yehiaabdelm 66 days ago \| 225 comments
3.		Production RAG: what I learned from processing 5M+ documents (abdellatif.io)
		551 points by tifa2up 85 days ago \| 114 comments
4.		Show HN: I invented a new generative model and got accepted to ICLR (discrete-distribution-networks.github.io)
		656 points by diyer22 3 months ago \| 91 comments
5.		Matrix Core Programming on AMD GPUs (salykova.github.io)
		116 points by skidrow 3 months ago \| 5 comments
6.		How attention sinks keep language models stable (hanlab.mit.edu)
		219 points by pr337h4m 5 months ago \| 36 comments
7.		Compiling LLMs into a MegaKernel: A path to low-latency inference (zhihaojia.medium.com)
		314 points by matt_d 6 months ago \| 76 comments
8.		Show HN: Canine – A Heroku alternative built on Kubernetes (github.com/czhu12)
		320 points by czhu12 7 months ago \| 123 comments
9.		Quantum Computation Lecture Notes (2022) (math.mit.edu)
		166 points by ibobev 7 months ago \| 48 comments
10.		Look Ma, No Bubbles: Designing a Low-Latency Megakernel for Llama-1B (stanford.edu)
		236 points by ljosifov 7 months ago \| 31 comments
11.		An Almost Pointless Exercise in GPU Optimization (speechmatics.com)
		87 points by atomlib 7 months ago \| 3 comments
12.		Launch HN: Better Auth (YC X25) – Authentication Framework for TypeScript
		259 points by bekacru 7 months ago \| 106 comments
13.		'I paid for the whole GPU, I am going to use the whole GPU' (modal.com)
		154 points by mooreds 8 months ago \| 45 comments
14.		How to Write a Fast Matrix Multiplication from Scratch with Tensor Cores (2024) (alexarmbr.github.io)
		147 points by skidrow 8 months ago \| 17 comments
15.		Stop syncing everything (sqlsync.dev)
		656 points by neilk 9 months ago \| 126 comments
16.		Introduction to Deep Learning (CMU) (cmu.edu)
		165 points by rzk 9 months ago \| 27 comments
17.		Beyond Diffusion: Inductive Moment Matching (lumalabs.ai)
		202 points by outrun86 10 months ago \| 31 comments
18.		Sidekick: Local-first native macOS LLM app (github.com/johnbean393)
		325 points by volemo 10 months ago \| 92 comments
19.		Probabilistic Artificial Intelligence (arxiv.org)
		352 points by pavanto 10 months ago \| 97 comments
20.		DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling (github.com/deepseek-ai)
		391 points by mfiguiere 10 months ago \| 67 comments
21.		SIMD < SIMT < SMT: Parallelism in Nvidia GPUs (2011) (yosefk.com)
		138 points by shipp02 on June 10, 2024 \| 37 comments
22.		Static search trees: faster than binary search (curiouscoding.nl)
		656 points by atombender on Jan 1, 2025 \| 232 comments
23.		Willow, Our Quantum Chip (blog.google)
		1410 points by robflaherty on Dec 9, 2024 \| 528 comments
24.		Ian's Secure Shoelace Knot (fieggen.com)
		44 points by walterbell on Nov 16, 2024 \| 21 comments
25.		Optimizing a WebGPU Matmul Kernel for 1 TFLOP (zanussbaum.substack.com)
		172 points by zanussbaum on Nov 11, 2024 \| 80 comments
26.		Bit Twiddling Hacks (2009) (stanford.edu)
		92 points by elpocko on Aug 23, 2024 \| 25 comments
27.		σ-GPTs: A new approach to autoregressive models (arxiv.org)
		293 points by mehulashah on June 7, 2024 \| 93 comments
28.		Better RAG Results with Reciprocal Rank Fusion and Hybrid Search (assembled.com)
		249 points by johnjwang on May 30, 2024 \| 57 comments
29.		Show HN: Boldly go where Gradient Descent has never gone before with DiscoGrad (github.com/discograd)
		232 points by frankling_ on May 26, 2024 \| 66 comments
30.		GPUs Go Brrr (stanford.edu)
		1104 points by nmstoker on May 12, 2024 \| 263 comments
		More