christians site
some topics i have spent time on, am spending time on, or hope to spend time
on include:
-
data efficient finetuning, methods to produce quality synthetic and
partially synthetic data, environments that simulate real world envs
- post training, sft/rl/hierarchical curriculums
- separate vs unified omni modality architectures
- audio stuff, multitask audio learning, efficient speech inference
- fast inference, io/hardware aware algos
-
kerneltune
- training models to write performant triton kernels, dataset and model
published
-
audio-llama
- can language models understand audio without audio pretraining
-
realtime omni
- omni/multimodal inference server with qwen 2.5 omni or phi 4 multimodal.
text-to-text, text-to-speech, audio-to-text, audio-text-to-text,
image-text-to-text, audio-text-to-speech
-
learning implicit heuristics
- on-call agent environment with probability based synthetic env state
generation
github
directory