All Notes: Nonlinear Function

All Notes

rubber-duck debugging

sacred and profane

SuccessfulFriend highlighted this distinction which I should really read more about. At a high level it's about the distinction between…

safe objective

Language is a really natural way to tell AI systems what we want them to do. Some current examples: [ GPT ]-3 and successors (InstructGPT…

salt

In chemistry, a salt is a neutral-ish (not too acidic nor basic) compound held together by an [ ionic bond ]. Salts can be formed by [ acid…

scheduled sampling

Scheduled sampling is a training procedure for sequence models that attempts to mitigate [ exposure bias ] - the problem in which generation…

score function

The score function is the gradient of a log-density with respect to its parameters: It is the direction that we would move the parameters…

score matching

Aapo Hyvärinen: Estimation of Non-Normalized Statistical Models by Score Matching (2005) https://jmlr.org/papers/volume6/hyvarinen05a…

self-aware

self-confidence

[ unearned confidence ] [ agency and confidence ]

self-love

selfishness is moral

Sometimes it's necessary and right to prioritize my own interests, even if [ global utility ] is ultimately the only metric. Developing…

selflessness is rational

I need to genuinely care about other people and want the best for them, both in general, and for specific people in my life. Why? Obviously…

sense gate

Traditional Buddhism describes six "sense bases" or gates: the eye, ear, nose, tongue, body, and mind. Western science usually omits the…

sense of self

When we talk about "the self", or having a "sense of self", what do we mean? There is an interpretation in terms of [ consciousness ] - that…

sense of the possible

Take the statement 'human-level AI is possible'. As a kid, I saw this as obviously true. We can simulate physics, and brains are physical…

serotonin

There are 14 kinds of serotonin receptors; most (but not all) are [ G protein ]-coupled. The central nervous system has almost all of them…

shadow

Shadow work means, roughly speaking, a practice of noticing, loving, and integrating the parts of yourself that you've repressed (your…

shard theory

Shard theory's basic ontology of RL holds that shards are contextually activated, behavior-steering computations in neural networks…

side channel

sigma-algebra

simulator AI

References: https://generative.ink/posts/simulators/ It seems pretty clear that the intelligence emerging from [ language model ]s is not…

single-index model

The performance of an investment can be modeled as where the 'market return' is that of some sufficiently broad index such as the S&P 50…

skinship

In Korea (and maybe also Japan?) it's common for young guys to bond through physical touch and affection: hugging, holding hands, sitting in…

sleep

It's weird that we lie down every day to cease our consciousness, and sometimes to hallucinate. There are physiological benefits to sleep…

small steps

It's not a terrible summation of [ depression ] that it starts from seeing no way to achieve your goals. Sometimes that's because it's…

soft actor-critic

software engineering

software lessons from TFP

I've been really unhappy about how TFP is developed. It's felt pedantic. I waste a lot of effort thinking about things I don't want to think…

southeast asia travel tips

This is the advice I wish I'd had. It's catered to my preferences; caveat emptor. Packing: (for men) bring one pair of long pants, for…

sparse coding

paper 1: http://redwood.berkeley.edu/bruno/papers/VR.pdf basic idea: find a basis such that any given image (or whatever signal) can be…

sparse mixture of experts

References: Jacobs, Jordan, Nowlan, Hinton. Adaptive Mixtures of Local Experts (1991) Shazeer et al. Outrageously Large Neural Networks…

speech becomes less free

I used to be able to say 'superintelligent AI is possible'. Now in industry the notion of 'possible' is 'something I can myself do': by…

spiritual joy

What would true wireheading feel like? People have this impression that it'd be thin, exhausting, artificial, ultimately isolating and not…

spiritual path

stablecoin

A stablecoin is a dollar-denominated liability registered on a [ blockchain ]. It can be backed by USD reserves, as Tether allegedly is…

starting principle

Julian Shapiro recommends keeping a set of six 'Starting principles' that you use to make decisions. That's about all that you can…

state values, then action values

A common pattern in [ reinforcement learning ] pedagogy is to develop some idea first in the context of estimating state values , and then…

stationary

A [ stochastic process ] is (strictly) stationary if all of its joint distributions are invariant under time displacement. It is wide…

staying up late

I've always been an evening person more than a [ morning person ]. I often stay up until 1 or 2am, and in the absence of hard constraints it…

steering language models

Getting language models to align their output with human preferences would be highly useful for [ computational life coach ]ing. What's the…

stochastic differential equation

SDEs are typically written in terms of the differential of a Weiner process (Brownian motion), e.g., Although Weiner processes are nowhere…

stochastic gradient

stochastic process

A stochastic process is a collection of [ random variable ]s defined on a common [ probability space ] . Equivalently, it is a joint…

stoicism

Pasting a quote from Adam Smith by way of HN (source http://www.econlib.org/library/Smith/smMS7.html , I should read the whole thing…) that…

stopping time

A stopping time for a stochastic process is a time-valued That is, integer-valued for discrete-time processes and real-valued for…

strange loops

strong opinion weakly held

structural equation model

structural motive

A lot of confused discussion around large organizations comes from conflating individual motivations with larger-scale 'structural…

structured prediction

In kindergarten stats, you learn how to build a model that takes in data (a feature vector, image, sound file, etc) and predicts a single…

stupid ideas are good ideas

Revolutionary ideas must live in the blind spots of the current intellectual conversation; otherwise people would already be using them…

style guide

Note naming The general goal is to minimize the use of aliasing in links. In case where these guidelines suggest an unnatural or uncommon…

substantive questions I've had

substantive questions I've had these are things I've wondered about that were never answered properly in the classes in which I learned them…

substituted tryptamines

Substituted tryptamines - PsychonautWiki Tryptamine consists of an [ indole ] moeity plus a two-carbon (ethyl) chain with an amine group. We…

suffering

sufficient statistic

sugar

A sugar is any molecule with the empirical formula C(N)H(2N)O(N). These are like alkanes, which are C(N)H(2N + 2), except that each carbon…

superposition

A -dimensional vector can represent distinct orthogonal features, but due to the weirdness of [ high-dimension ]al geometry, it can…

surprises in having a job

What have I learned in 2.5 years at Google? What did I not realize? The model of research. How low the expectations are. How fake it felt to…

sybil attack

symmetry theory of valence

Refs: https://opentheory.net/Qualia_Formalism_and_a_Symmetry_Theory_of_Valence.pdf

syncing supernote with surface pro x

My Supernote A5X syncs through Dropbox, but unfortunately Dropbox doesn't support Windows ARM64 machines like the Surface Pro X. Here's my…

taṇhā

Buddhist (Pali) term referring to craving, longing, desire for the world to be other than as it is. This includes craving good things and…

talk about people

There's a famous quote attributed to Eleanor Roosevelt: "great minds discuss ideas, average ones events, mediocre ones discuss people". This…

tantra

A set of methods for maintaining an " attitude of spacious passion ". The particular methods are contingent; if you could maintain the…

target network

A general issue with [ temporal difference ] learning methods, which 'update a guess towards a guess', is that they can end up 'chasing…

teacher forcing

Something that confused for me for a while is that people in certain communities talk about 'teacher forcing' as though it's a trick or a…

teaching

Dave's principles of effective teaching. Motivation is by far the most important thing. A student who wants to learn will learn even with a…

teaching at the critical point

As a researcher, I wonder if there's a 'critical point' of growing an idea when it's important to be [ teaching ] it, whether formally or…

teaching lessons learned

working with Sinclair, Klein, Abbeel, they’ve all got great experience and advice especially for large classes You don’t have to give the…

teaching machine learning

Rob wants to firm up his foundations. He wants to understand relevant stats, probabilistic models, inference, and maybe work our way up to…

television is useless

Epistemic status: either this is true or TV is maybe one of the greatest contributions to human utility ever. Unclear. The average American…

temporal difference

From David Silver's slides : TD-learning 'updates a guess towards a guess'. Sutton and Barto define the temporal difference error as the…

ten-year goals

tension

This page (first brainstormed in an Otter note) is for issues where I feel pulled in several directions. Different principles seem to yield…

tensor

Every in machine learning talks about tensors, but no one really understands what they are. This page collects several definitions and…

tensor product

The tensor product of two vector spaces (defined on the same scalar field, we'll assume ) is the vector space of formal sums of…

terry tao on statistical mechanics

This post gives a nice, mathematically clear development of basic terms in statistical mechanics. Highlights: Think of a physical system as…

testable prediction

thai ingredients

thai holy basil / hot basil thai basil turmeric rice noodles: thin (pad thai) or wide (pad see ew / pad kee mao) oyster sauce, fish sauce…

the appropriate kind of suffering

the balanced-utility trap

Status: in conflict with [ negative utility ] ? See also: the [ hedonic treadmill ]. Evolutionary, 'pain' exists to motivate you to get out…

the best things have many stories

I used to think that there was a 'best' way to motivate an area. For example, in VI, the ELBO is derived from the KL divergence between a…

the buddha solved his problem, now solve yours

NameRedacted often mentions this as a thing Sharon Salzberg would say. What does it mean? I don't know - I should ask Tucker to clarify…

the dance

It's almost never worth worrying about whether an individual action is the right thing to do. It's like trying to dance while worrying at…

the discourse is wrong

In order for a group of people, like an academic field, or a political elite, to meaningfully converse about a complex topic, they have to…

the map is not the territory

Metaphor connected to the observation that [ all models are wrong ]. Borges, On Exactitude in Science : ...In that Empire, the Art of…

the mind contains the world

A point made by NameRedacted in various talks, e.g. The World is Inside You (also the '[ emptiness ] of perception' described by [ Dan…

the mistake is upstream

Comparing myself to SuccessfulFriend, I might be tempered to think that because he is interested in antitrust law, zoning reform, political…

the null hypothesis is always wrong

Andrew Gelman believes that in certain areas of research , like the social sciences, everything is connected. "I’m not expressing…

the privilege of advice working out

In every field, there is a store of 'standard' advice that is handed down from mentors to ambitious youngsters. In computer science grad…

the purpose of life

The Feynmannian/Sagan/Tyson "scientific" view is that the [ purpose ] of life is understanding : the world is a giant mystery, with layers…

the self is a construct

It exists, but is [ empty ], insubstantial, a [ fabrication ]. Foregrounding this view is an important part of [ awakening ] or…

the system is bad

[ things are deeply wrong ]

the system wants you to have ownership

If I'm managing someone, I want them to be coming up with their own ideas and owning them. Owning their ideas means they will themselves…

theory of intelligence

tl;dr : the ideas we need to build intelligent systems may be different from those we need to understand them. Both are important, but…

theory of the case

Several ideas here: When I try to tell a story about what I'd like to change about my life, at a high level, I can come at it from different…

therapy

there's never a single cause

For several reasons: multiple object-level causes a telescoping tower of causes at increasing levels of generality or abstraction 'because…

there are no paradoxes, just bad models

If two statements that both seem true conflict with each other, then it seems like you have a paradox. But the world itself is just as it is…

there is no speed limit

theses are great sources

Pointed out in this tweet: https://twitter.com/AmandaAskell/status/1311776280128479238 but also in many other places over the years…

things I believe that no one else believes

AI is going to work. Obviously lots of people believe this. But most 'AI' companies and 'AI' investors are hyping applications of current…

things I will always do

No matter what other priorities or any incredibly important goals arise in my life, whether through work, family, or other circumstances…

things I would like to do

Write Write regularly: under routine circumstances, at least a few minutes per day. This could be filling in nodes of this graph, blogging…

things are deeply wrong

See also: [ the system is bad ] I find it hard to be okay with a 'normal' life, because that would imply some level of acceptance of the…

things school should teach

things that are always productive

These might not be the best thing to do at any point, but they're better than doing nothing. And doing them can create a sense of progress…

things to build

See also [ writing inbox ] See also [ ongoing projects ] See, first and foremost, the backlinks below. Crypto trading model. Write a system…

this is all there is

This is one of those things that sounds cliche but is still profound and #fundamental: this is all. There's no great reward in the future…

thought vector

thoughts about kids

I want kids, eventually. I want to be able to talk with them, to build a relationship, to see the world through someone else's eyes. I want…

thoughts are actions

The [ agent ] model of intelligence imposes a sharp distinction between the agent and its environment, where the agent 'chooses' actions…

thoughts on multivariate causalimpact

let's say the signal we see after the intervention is modeled as the combination of the counterfactual forecast and an intervention effect…

three characteristics

[ impermanence ] [ dukkha ] (unsatisfactoriness) [ no-self|annita ] (no-self) Daniel Ingram's summary: things "come and go, don't satisfy…

three questions

(fellow student) Smitha has these post-its on her desk: what are you doing? why is it important? are you making progress? I think these…

tissue paper thin

This is personal mental image for [ emptiness ] that has been really resonant for me, arising from an experience taking [ MDMA ] with a…

to a first-time employee

It helps a lot to write down the things you think someone should know about working in a new environment. Even if a new person would figure…

to those whom much is given, much is expected

I feel an obligation to try to do big things with my life, because I've had access to rare opportunities. If ten thousand randomly selected…

to watch

Television: Arcane ken burns on the vietnam war WandaVision For All Mankind Severance: https://m.imdb.com/title/tt11280740/ Borgen Diplomat…

tokenize

How should a machine learning model represent text? Word-level and character-level features are obvious options, but both have drawbacks…

tool AI

Sometimes mentioned as a potential approach to [ AI safety ]. Gwern: Why Tool AIs want to be Agent AIs (roughly: because treating…

toolformer

Notes on Toolformer: Language Models Can Teach Themselves to Use Tools The basic method is: "Given just a handful of human-written examples…

trace

Trace of a Linear Operator We define the trace as the sum of diagonal elements of a matrix: Lemma : If and are square, then . Proof…

tractable approximations to utilitarianism

There are three main approaches to moral philosophy: [ utilitarian ]ism: you should feed a starving person because it will increase 'global…

training for consistency

These days we think a lot about using data to train large [ language model ]s. But there's only so much data in the world; eventually we'll…

training researchers

I didn't have a good intuitive understanding of the social landscape of being a researcher (and joining a [ research community ]). When…

transactions are positive-sum

If you and I agree of our own volition to exchange X for Y, this implies that we both believe we are gaining value in the trade. If one of…

transformer

The core of the transformer architecture is multi-headed [ attention ]. The transformer block consists of a multi-headed attention layer…

transformer parallelization math

What does the computational profile of a transformer vs a similar RNN look like? First, the transformer. Let's take the LLama 6.7B model…

transformer primatives

In developing intuition about [ transformer ]s it's useful to think about specific primitive operations that can be implemented by a small…

transformers with memory

Incorporating explicit memory and retrieval seems pretty clearly like the next frontier in language modeling and AI more broadly. We have…

transposes are measures

According to this reddit post , one of the main takeaways of functional analysis is that the right way to interpret the 'transpose' of a…

trapped priors

SSC link: How general is this phenomenon? You have a belief Your belief colors your perception of something that doesn't inherently…

trituration

true but wrong

A pitfall with relying too heavily on rational deduction is that lots of logically 'true' conclusions are unimportant, or worse yet…

trust region policy optimization

(notes loosely based on the Berkeley deep RL course lecture ) Setup: RL with policy gradients The basic setup is that we want to optimize…

truth is a low bar

Language is an incredible bottleneck. There are infinitely many true facts about the world, even just in pure math, and yet we communicate…

trying new things in the bedroom

The reason to try new things is not really because the new things themselves are more exciting than the old ones. The reason is that it…

tryptamine

tryptophan

type 2 decisions

From Jeff Bezos' 1997 shareholder letter : Some decisions are consequential and irreversible or nearly irreversible – one-way doors – and…

type theory

Inspired by Kevin Buzzard's overview of the state of automatic theorem provers. Type theory is like set theory in that sets and types are…

unconditional love

nostalgebraist argues that unconditional love can't and shouldn't exist : A parent might love their child "unconditionally," in the well…

unearned confidence

Confidence is a positive [ feedback loop ]. You need a certain amount of confidence to try something hard. Success breeds more confidence…

union bound

It's a basic law of probability that, given two events A and B, the probability that at least one of them occurs is given by This counts the…

unique contribution

(this note expresses a tendency that I notice in myself. I don't necessarily endorse this tendency but I think it's interesting to…

universal basic hedonism

universal consciousness

universal suffering

From a review by [ Oliver Burkeman ] of Jordan Peterson's "Beyond Order" ( https://www.theguardian.com/books/2021/mar/02/beyond-order-by…

unsupervised pretraining

useful lens

like a 'useful perpective', but 'lens' implies focus or distortion whereas 'perspective' implies linear projection. Related to [ many models…

useful reading

Consuming unstructured content from the internet is addictive. Twitter is full of life advice, interesting technical discussion, takes on…

utilitarian

value aligned language game

Suppose I have an agent that generates text. I want it to generate text that is [ value alignment|aligned ] with human values. Approaches…

value alignment

value in stating the obvious

value learning

Notes on the Alignment Forum's Value Learning sequence curated by Rohin Shah. ambitious value learning : the idea of learning 'the human…

values all the way down

The standard [ Markov decision process ] formalism includes a reward function ; the total (discounted) reward across a trajectory is its…

variational inference

How should people do VI? One ultimate goal is that you write a Stan model (or better, a model with discrete variables, but one step at a…

variational optimization

Holy shit. In December on Galiano I was brainstorming about [ continuous structure learning ] and thought of the general trick, for…

vegetarian

Inspired by NameRedacted, I'm considering going 'mostly' vegetarian. What would that mean for me? I don't myself buy meats or dairy products…

vision for my garden

Why am I doing all of this? If I carve aside hours or days or months to 'fill in' my graph of notes, what am I hoping to get from it? Why is…

vision transformer

Ref: https://arxiv.org/abs/2010.11929 We start by chunking an image into patches, and concatenating each patch with a position embedding…

vulnerable

Telling people about your failures, your fears, your self-doubt, your insecurities can be a path towards deeper connection. Understanding…

warmth

How to be warm: https://www.youtube.com/watch?v=1MolmoFuXu4&t=123s

weak ties

The "strength of weak ties": most good things in life come from people you barely know. This is because your close, regular connections are…

wealth tax

Thinking through: Why the toughest capitalists should root for a wealth tax ( https://www.ft.com/content/e1adf707-b95a-4422-9211-1841cd7ce…

web3

Moxie Marlinspike on web3: https://moxie.org/2022/01/07/web3-first-impressions.html We know that people do not want to run their own…

weekly review

[ weekly review ] • Plus: What went well? • Minus: What didn't go so well? • Next: What will I focus on next week?

weighted importance sampling

Reference: Mahmood et al., 2014. Weighted importance sampling for off-policy learning with linear function approximation Here's a situation…

what I am doing wrong

I suspect many of these are evergreen. I'm not [ writing ] enough. I'm not keeping up a regular journaling practice.

what I have lost

Just like norms in the Trump administration, there are mental habits, rhythms of life, attitudes towards the world, that are powerfully…

what to say

In the course of any person's life, you take in a vast amount of information. You have your own personal experiences, of course, and you…

what to teach students

See also: [ if ever a prof ], [ advice for college students ] Things not directly related to course material that I wish I'd learned earlier…

when I quit

What will I do when I don't have a job? I don't feel that I have a clear direction. I want to learn and explore. There are lots of [ my…

why would you ever let your mind get like that

A story from [ Dan Brown ]: A group of psychologists came to interview the Dalai Lama, the spiritual leader of Tibet. One of the Americans…

winning the game

As a kid, we learned about https://en.wikipedia.org/wiki/The_Game_(mind_game) : if you think of the game, you lose. (and have to say "I…

wisdom I've acquired

From 2017: wisdom I've acquired: the psychology of depression. :-( and grad school. :-( and being gay. [ dual-process cognition ] theory…

work quotes

“It was true that I didn’t have much ambition, but there ought to be a place for people without ambition, I mean a better place than the one…

world model

worldly objective

This may be a central point of confusion: how do we define AI systems that have preferences about the real world , so that their goals and…

write libraries, not frameworks

In software: a library is a collection of tools. You can use some or all of them, in combination with other tools. A framework , on the…

write up

your writing needs to be at the edge of your knowledge, it needs to address the most fascinating people you know or can imagine. That is…

writing

Quote I like from Manuel Blum's advice to grad students , connecting writing to the power of [ Turing machine ]s: STUDYING: You are all…

writing a project proposal

What is the philosophy of the project? What principles is it betting on? Example from Ben's Ads doc: iterating on an end-to-end pipeline…

writing habits

Regular writing practices that would be valuable. [ prediction as a model-building exercise ]

writing inbox

wrong models in AI

The models we use in AI are [ all models are wrong|wrong ] (if maybe still useful). How? Agency The [ agent ] model assumes a separation of…

yaas is the inauthentic yes

status: a theory that feels true for my personal trajectory. Totally uncritiqued and unverified that anyone else shares this experience…

you are the sum of the people you spend time around

you can learn everything

Sometimes it's daunting how much knowledge there is in the world. For any given area, there are a thousand specialties and subspecialties…

your network matters

Something that SuccessfulFriend said today: It's rare that someone totally independent comes up with a really good idea. The best ideas come…

zero knowledge

A zero-knowledge proof allows a prover to demonstrate that it possesses certain information, without revealing that information to the…

zk-SNARK

A zk-SNARK, or zero knowledge Succinct Non-interactive Argument of Knowledge, is a [ zero knowledge ] proof system that is non-interactive…