moomou

moomou https://paul.mou.dev/ Recent content on moomou Hugo -- gohugo.io en-us since 2017 Sun, 31 Dec 2023 00:00:00 +0000 Listening with LLM https://paul.mou.dev/posts/2023-12-31-listening-with-llm/ Sun, 31 Dec 2023 00:00:00 +0000 https://paul.mou.dev/posts/2023-12-31-listening-with-llm/ Overview This is the first part of many posts I am writing to consolidate learnings on how to finetune Large Language Models (LLMs) to process audio, with the eventual goal of being able to build and host a LLM able to describe human voices. Learning to Debug Gibberish https://paul.mou.dev/posts/2023-12-27-ohmy/ Wed, 27 Dec 2023 00:00:00 +0000 https://paul.mou.dev/posts/2023-12-27-ohmy/ Overview Recently, I went down a rabbit hole of debugging opensource LLM spewing gibberish on my PC. The investigation was the one of the most difficult (but interesting) debugging experience I have encountered so far. CORS with Cookie https://paul.mou.dev/posts/2022-08-13-cors-with-cookie/ Sat, 13 Aug 2022 00:00:00 +0000 https://paul.mou.dev/posts/2022-08-13-cors-with-cookie/ Introduction CORS stands for Cross-Origin-Resource-Sharing and is the HTTP mechanism to allow servers to accept requests from other host locations other than its own. Messing with Nvidia GPU on Headless Linux https://paul.mou.dev/posts/2021-12-29-nvidia-gpus/ Wed, 29 Dec 2021 00:00:00 +0000 https://paul.mou.dev/posts/2021-12-29-nvidia-gpus/ I have a PC with Ubuntu server installed and Nvidia GPUs attached. I have Googled on and off for a while trying to learn how to overclock the GPUs without success. Improvements to "m", a personal command line tool https://paul.mou.dev/posts/dev-ux2/ Sat, 22 Feb 2020 00:00:00 +0000 https://paul.mou.dev/posts/dev-ux2/ From m to m2 Having used m for many years now, I encountered two primary problems. The first is startup performance. Training a Speaker Embedding from Scratch with Triplet Learning https://paul.mou.dev/posts/speaker-embedding/ Sat, 05 May 2018 23:01:25 -0700 https://paul.mou.dev/posts/speaker-embedding/ Posted on go.mou.dev/triplet-embedding-learning ML paper notes https://paul.mou.dev/notes/ml_notes/ Fri, 01 Sep 2017 23:01:25 -0700 https://paul.mou.dev/notes/ml_notes/ 2017-09 LEARNING FINE-GRAINED IMAGE SIMILARITY WITH DEEP RANKING describes efficient sampling technique based on reservoir sampling for building triplets; requires an relevance function multi scale CNN DEEP METRIC LEARNING USING TRIPLET NETWORK learns a semantic embedding; results show better discrimination vs siamese network (contrastive loss function) MSE softmax shows improved performance rather than simple binary softmax (see paper for def) feed a triplet of x, x1, x2 where x1 is same class as x and x2 is different DISTILLING THE KNOWLEDGE IN A NEURAL NETWORK explores compression technique of ensemble model into a single model Distillation softmax qi = exp(zi/T)/Sigma(j)(exp(zj/T) where z are logits and T is temperature T is usually 1 increasing T creates softer probability distribution knowledge is tranferred via training smaller/compressed model by targeting over softer target (ie temperature T > 1) from more cumbersome model small model trained with higher T as well but in prediction mode uses T = 1 tranfer training can be improved by using datasets with true label demonstrate distillation with minist dataset - tranfer works well even when smaller model trained by omitting certain numbers discusses using soft distribution target technique for training specialists on very large datasets Google internal JFT data of 100M images Questions teacher - student model, relation to curriculum learning? Tools https://paul.mou.dev/notes/tools/ Fri, 01 Sep 2017 23:01:25 -0700 https://paul.mou.dev/notes/tools/ Fzf Amazing command line tool to fuzzy search for files. Cannot live without this. Also integrates with vim. https://github.com/junegunn/fzf Jq Swiss army knife for working with JSON on the command line Developer Experience https://paul.mou.dev/posts/dev-ux/ Tue, 22 Aug 2017 23:01:25 -0700 https://paul.mou.dev/posts/dev-ux/ Commandline Productivity and Automation (aka make it easy to repeat) Developers tend to repeat themselves. A lot. This can be as innocuous as running a test manually after you update a test file or as insidious as deploying a newly built binary into production manually.