4 Philosophies of Interpretability

A talk I gave to my MATS 8.0 training program laying out what I view as the main philosophies and approaches to doing interpretability research, the pros and cons, and the different perspectives they give on standards of evidence and how one might approach a problem.