Hacker Newsnew | past | comments | ask | show | jobs | submit | csnweb's favoriteslogin
1.Show HN: OpenWorkers – Self-hosted Cloudflare workers in Rust (openworkers.com)
500 points by max_lt 12 days ago | 158 comments
2.Valdi – A cross-platform UI framework that delivers native performance (github.com/snapchat)
534 points by yehiaabdelm 66 days ago | 225 comments
3.Production RAG: what I learned from processing 5M+ documents (abdellatif.io)
551 points by tifa2up 85 days ago | 114 comments
4.Show HN: I invented a new generative model and got accepted to ICLR (discrete-distribution-networks.github.io)
656 points by diyer22 3 months ago | 91 comments
5.Matrix Core Programming on AMD GPUs (salykova.github.io)
116 points by skidrow 3 months ago | 5 comments
6.How attention sinks keep language models stable (hanlab.mit.edu)
219 points by pr337h4m 5 months ago | 36 comments
7.Compiling LLMs into a MegaKernel: A path to low-latency inference (zhihaojia.medium.com)
314 points by matt_d 6 months ago | 76 comments
8.Show HN: Canine – A Heroku alternative built on Kubernetes (github.com/czhu12)
320 points by czhu12 7 months ago | 123 comments
9.Quantum Computation Lecture Notes (2022) (math.mit.edu)
166 points by ibobev 7 months ago | 48 comments
10.Look Ma, No Bubbles: Designing a Low-Latency Megakernel for Llama-1B (stanford.edu)
236 points by ljosifov 7 months ago | 31 comments
11.An Almost Pointless Exercise in GPU Optimization (speechmatics.com)
87 points by atomlib 7 months ago | 3 comments
12.Launch HN: Better Auth (YC X25) – Authentication Framework for TypeScript
259 points by bekacru 7 months ago | 106 comments
13.'I paid for the whole GPU, I am going to use the whole GPU' (modal.com)
154 points by mooreds 8 months ago | 45 comments
14.How to Write a Fast Matrix Multiplication from Scratch with Tensor Cores (2024) (alexarmbr.github.io)
147 points by skidrow 8 months ago | 17 comments
15.Stop syncing everything (sqlsync.dev)
656 points by neilk 9 months ago | 126 comments
16.Introduction to Deep Learning (CMU) (cmu.edu)
165 points by rzk 9 months ago | 27 comments
17.Beyond Diffusion: Inductive Moment Matching (lumalabs.ai)
202 points by outrun86 10 months ago | 31 comments
18.Sidekick: Local-first native macOS LLM app (github.com/johnbean393)
325 points by volemo 10 months ago | 92 comments
19.Probabilistic Artificial Intelligence (arxiv.org)
352 points by pavanto 10 months ago | 97 comments
20.DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling (github.com/deepseek-ai)
391 points by mfiguiere 10 months ago | 67 comments
21.SIMD < SIMT < SMT: Parallelism in Nvidia GPUs (2011) (yosefk.com)
138 points by shipp02 on June 10, 2024 | 37 comments
22.Static search trees: faster than binary search (curiouscoding.nl)
656 points by atombender on Jan 1, 2025 | 232 comments
23.Willow, Our Quantum Chip (blog.google)
1410 points by robflaherty on Dec 9, 2024 | 528 comments
24.Ian's Secure Shoelace Knot (fieggen.com)
44 points by walterbell on Nov 16, 2024 | 21 comments
25.Optimizing a WebGPU Matmul Kernel for 1 TFLOP (zanussbaum.substack.com)
172 points by zanussbaum on Nov 11, 2024 | 80 comments
26.Bit Twiddling Hacks (2009) (stanford.edu)
92 points by elpocko on Aug 23, 2024 | 25 comments
27.σ-GPTs: A new approach to autoregressive models (arxiv.org)
293 points by mehulashah on June 7, 2024 | 93 comments
28.Better RAG Results with Reciprocal Rank Fusion and Hybrid Search (assembled.com)
249 points by johnjwang on May 30, 2024 | 57 comments
29.Show HN: Boldly go where Gradient Descent has never gone before with DiscoGrad (github.com/discograd)
232 points by frankling_ on May 26, 2024 | 66 comments
30.GPUs Go Brrr (stanford.edu)
1104 points by nmstoker on May 12, 2024 | 263 comments

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: