Research Assistant — University of Alberta

Devon
Yanitski

I study what's latent in minds and language models — psychometrics from embeddings, real-time EEG, and LLM interpretability tools.

Department of Psychology, University of AlbertaComputational Memory Lab

Pillar 01 · Steering & structure

What's latent in language models

Opening up language models — sparse-autoencoder feature steering, and evolving vectors through embedding space to predict human behaviour.

loading the rat…

A Qwen3.5-2B with a sparse-autoencoder “Rat Brain” dial — drag it up and watch one steered SAE feature (#26631) take over the model’s personality. An open-source, rat-themed take on Golden Gate Claude.

sparse autoencodersfeature steeringQwen3.5-2BRats

GitHub →

Paperprivate

Devisal of the Fittest: Using Genetic Programming to Estimate Lexical Likely Need

Chris Westbury, Devon Yanitski, Anna Brooker

Genetic programming evolves 'average word' vectors in embedding space to predict lexical-decision and naming reaction times — operationalizing the cognitive notion of 'likely need'.

genetic programmingembeddingslexical decisionRT modeling

Submitted to Behavior Research Methods

Pillar 02 · Meaning as co-occurrence

Psychometrics from embeddings

Can the geometry of an embedding space stand in for human survey responses? Semantic Factor Analysis recovers latent psychological structure from item text alone.

Tool

semanticfa

Devon Yanitski

An R package for response-free semantic analysis of psychometric scales — a full toolkit that recovers and refines a scale's latent structure from item wording alone. Factor analysis on item embeddings is the centerpiece, alongside semantic similarity matrices, factor retention, semantic 'loadings', theory comparison, redundancy / jingle-jangle checks, and short-form building. No human response data required.

Rfactor analysisembeddingspsychometrics

GitHub

Big Five · embedding space

drag to rotate

loading embedding space…

50 Big-Five items embedded by Qwen3-Embedding-8B (4096-d), projected to 3D with UMAP on cosine geometry. Items cluster by factor — meaning recovered from wording alone.

Pillar 03 · Behavioural probes

Do LLMs replicate human behaviour?

Do language models choose like utilitarians, and does reasoning shift their judgements the way it shifts ours? Measuring when LLMs replicate human behaviour.

Paperprivate

Reasoning in LLMs Causes More Utilitarian Judgements

Devon Yanitski

Does the human dual-process finding — that reasoning increases utilitarian moral judgment — extend to LLMs? Across 122 open-source models run on a 40-item moral-dilemma battery, switching on reasoning raised utilitarian responding and lowered altruism and deontology: the human dual-process signature, reproduced in a system with no affect, kin, or evolved morality.

moral psychologyreasoning122 LLMsprocess dissociation

Paperprivate

LLMs validate the Cognitive Reflection Test

Devon Yanitski

122 open-source LLMs took an updated Cognitive Reflection Test (Meyer et al., 2024), each item answered twice — once on intuition, once with reasoning. Holding arithmetic ability fixed, engaging reasoning alone moves the score, supporting the claim that the CRT is separable from ability — a cleaner demonstration than human data, where CRT scores confound ability, disposition, and motivation.

cognitive reflection testreasoning122 LLMsMeyer 2024

Pillar 04 · Neural signals

Reading the live brain

Real-time EEG tooling — Muse Visualizer turns the raw waveforms and oscillatory band activity from a consumer headband into highly customizable, artistic live representations of the brain.

Live EEG · pianoopen ↗

loading clip…

Playing the Council of Elrond theme (LOTR) on piano while my live brain activity renders in Muse Visualizer.

Tool

Muse Visualizer

Ultra-high-performance (200+ FPS) real-time EEG visualization for the Muse headband — raw waveforms and Delta/Theta/Alpha/Beta/Gamma band power over Lab Streaming Layer.

PyQtGraphLSLsignal processingPPG

GitHub

Pillar 05 · AI-assisted research

Improving AI-assisted research

I am building the scaffolding that lets AI agents carry out autonomous psychology research — starting with how they ground themselves in the literature.

Tool

AP-RAG

A major roadblock preventing AI agents from performing autonomous research are hallucinations — when an LLM lacks the full context of an area of research, it often makes a plausible yet incorrect guess. However, it isn’t possible to dump all relevant prior research into an LLM's limited context window. An academic-paper retrieval-augmented generation (AP-RAG) system solves both problems: it lets AI agents efficiently retrieve the relevant chunks of information from a large body of research.

graph RAGacademic papersretrieval

GitHub

Context-window budget

Number of papers10Tokens / paper30,000

overhead 10,000papers 300,000overflow +60,000

A full paper runs ~30k tokens, so a 250k-token context fits only about 8 whole papers before it overflows. You can’t just paste the library into the prompt — that’s why a RAG system retrieves the few passages that actually matter. AP-RAG chunks academic papers into passages of 300–800 tokens, letting it synthesize information from hundreds of papers in a single context window.

Retrieval modes

Different questions need different evidence — pinning down a single paper isn’t the same as surveying a whole field — so AP-RAG offers several retrieval modes.

Combine entity- and relation-centric retrieval, merge the results, then recover the supporting chunks through the graph.

entities_vdb

relationships_vdb

chunks_vdb

knowledge graph

source chunks

answer model

Best for: Most everyday research questions.

Also — built for fun

Tools & toys

Tool

Slurm Status Bar

A VS Code extension showing real-time Slurm job status and fairshare trends across HPC clusters in the status bar and a rich dashboard.

TypeScriptVS CodeSLURM

GitHub

Game

Txtr

A Temple-Run-inspired typing game about texting and driving — dodge five lanes of traffic with the arrow keys while typing scripted replies. Vanilla HTML/CSS/JS on a canvas.

JavaScriptcanvastyping game

GitHub

Tool