Cyrus Samii – Page 2

January 18, 2022December 14, 2023

Methods for situating a scholar in their field

I am putting these notes here to remind me of steps and also in case others are curious about doing something similar.

Suppose we want to situate a scholar in their field, for example as part of a tenure review case. One way to do that is to look at the scholar’s papers and see who they are citing:

Go to their Google scholar profile and pull up their papers. Choose some of their most cited papers (reflecting how others see the scholar’s contributions) and some of their most recent papers (reflecting their current thinking).
Construct the network of people that the scholar references in their most prominent work.

A low-tech way to do this is to copy/paste bibliographies from the papers into https://anystyle.io/ to put the bibliographies into machine readable format (e.g., bibtex). I like to tag the entries from each paper’s bibliography by the date of the paper’s publication (e.g., by adding a custom field to the bibtex file) so that I can sort and see how the scholar’s reference base has changed over time. Compile the different bibliographies into a library in a reference manager. If you keep duplicate entries you get a sense of the scholar’s key points of reference.

A higher-tech way to do this is to use the Connected Papers app. You can look at the graph to find well-cited work that the scholar tends to reference.

UPDATE (12/14/23): The “InfluenceMap” project allows for creating an influence diagram (people that the scholar draws up, and then people who cite the scholar): [link]

Pare down the list to seminal contributions. E.g., keep only entries from relevant general interest and field journals that are highly cited.

Now some analyses:

First, who appears most often in the library? What does the work of these primary referents represent in the literature and how does the current scholar’s work relate?

Second, whose work is being referenced at different times over the course of the scholar’s career (I do this using the custom field described above)? What does this say about how the scholar’s work has evolved alongside the reference literature?

As far as I know, the steps above are not as well-automated as methods to see who else is citing the scholar’s work (there are numerous tools to do that, like the “scholar” package in R). Would love to see someone do it (and welcome any suggestions below).

May 18, 2021

Readings on statistical discrimination and inefficiency

A tweet by Sarah Jacobson prompted a few discussion threads on current perspectives on statistical discrimination and efficiency/inefficiency. Here is the original tweet:

I used to think economists used the idea of statistical discrimination to understand how discrimination could be defeated – by info vs some other means. But it seems like instead some think of it as a justification – a reason discrimination is OK. Sometimes economists bum me out.
— Sarah Jacobson (@SarahJacobsonEc) May 18, 2021

I have collected references to some of the papers that discussants mentioned as providing more refined takes on the original Arrow and Aigner-Cain analyses:

Lundberg, Shelly J., and Richard Startz. “Private discrimination and social intervention in competitive labor market.” The American Economic Review 73.3 (1983): 340-347.
Schwab, Stewart. “Is statistical discrimination efficient?.” The American Economic Review 76.1 (1986): 228-234.
Coate, Stephen, and Glenn C. Loury. “Will affirmative-action policies eliminate negative stereotypes?.” The American Economic Review (1993): 1220-1240.
Bohren, J. Aislinn, et al. Inaccurate statistical discrimination. No. w25935. National Bureau of Economic Research, 2019.
Lang, Kevin, and Ariella Kahn-Lang Spitzer. “Race discrimination: An economic perspective.” Journal of Economic Perspectives 34.2 (2020): 68-89.
Komiyama, Junpei, and Shunya Noda. “On Statistical Discrimination as a Failure of Social Learning: A Multi-Armed Bandit Approach.” arXiv preprint arXiv:2010.01079 (2020).
Fosgerau, Mogens and Sethi, Rajiv and Weibull, Jorgen W., Costly Screening and Categorical Inequality (April 21, 2021). Available at SSRN: https://ssrn.com/abstract=3533952 or http://dx.doi.org/10.2139/ssrn.3533952

October 27, 2020

Design-Based Inference for Spatial Experiments with Interference

Excited to share “Design-Based Inference for Spatial Experiments with Interference”, joint with Peter M. Aronow and Ye Wang: arxiv

In settings with complex spatial effects and interference, the paper defines a type of marginal effect, the “average marginalized response,” that has a clear interpretation and can be identified with a spatial experiment and a simple contrast.

It took time to work out details for robust inference, and finally got there with Ye working out reasonable conditions that justify the spatial HAC variance estimator, and then by connecting to a breakthrough CLT result from Ogburn et al. (2020; arxiv link).

We are working on the public release of the R package and also a more didactic paper that walks through applications. Stay tuned for those.

April 21, 2020April 21, 2020

Using pre-analysis plans to learn better and to learn together

Below is a Twitter thread in which I offer a perspective from my experience through EGAP (egap.org) on how to make effective use of pre-analysis plans and also research designs. The basic idea is that your research design and pre-analysis plan should serve as the basis of a discussion in which you can refine your design and analysis and gain buy-in from skeptics. A research design or pre-analysis plan that is never discussed publicly before it is implemented is a huge missed opportunity.

The thread was in response to a paper by Duflo et al. (linked in the thread) who focus mostly on pre-analysis plans as ways to bind yourself, without giving much consideration to the idea of using them as the basis of having an ex ante conversation about the research.

The thread is here:

As current Executive Director of @EGAPtweets let me explain how EGAP does research designs and pre-analysis plans (RDs/PAPs), which I think is different from what these authors understand as the point of RDs/PAPs. 1/9 https://t.co/f4DwqxVsWL
— Cyrus Samii (@cdsamii) April 21, 2020

RDs/PAPs should be a vehicle for having an ex ante *conversation* about what you are doing, as a way to refine your study and also get buy-in and agreement. 2/9
— Cyrus Samii (@cdsamii) April 21, 2020

This is similar to what Duflo et al discuss as the “interested party” rationale. I would say that this is more general than the set of cases that Duflo et al consider. 3/9
— Cyrus Samii (@cdsamii) April 21, 2020

Much of the work that is carried out via @EGAPtweets consists of expensive trials of policy innovations conducted with NGOs and governments. We don’t want to waste opportunities or mess these up. Feedback prior to implementation is *really* important. 4/9
— Cyrus Samii (@cdsamii) April 21, 2020

RDs/PAPs that are never reviewed or discussed prior to their implementation are kind of pointless in my opinion. 5/9
— Cyrus Samii (@cdsamii) April 21, 2020

The primary role of @EGAPtweets is to organize sessions to discuss RDs/PAPs. Also regional workshops (like NEWEPS, etc). do the same. The short PAPs that these authors advocate wouldn/t make for interesting sessions, and being able to do such sessions is the point. 6/9
— Cyrus Samii (@cdsamii) April 21, 2020

It seems that in econ this isn’t the practice, as seminar time is for finished and polished work. I honestly think many just don’t know what they are missing and would value @EGAPtweets style ex ante feedback. 7/9
— Cyrus Samii (@cdsamii) April 21, 2020

Research on COVID has made clear the need for these RD/PAP discussions, given problems with design and then also analysis of the trials and surveys. If such research were subject to pre-field and pre-analysis discussions among critical audiences, learning would be faster. 8/9
— Cyrus Samii (@cdsamii) April 21, 2020

As it is now, few people believe the results that are being produced because the designs and analyses are weak. Here the stakes are very real. 9/9
— Cyrus Samii (@cdsamii) April 21, 2020

January 29, 2020January 29, 2020

Open source environments for structural estimation

If you click on the tweet below, you will get a conversation on open source options (essentially Python, Julia, and R) for students interested in getting started with structural estimation:

For the structural estimation folks: for students starting from scratch, what would be the recommended open source alternative to Matlab to learn programing estimation routines (e.g., like Nevo 2000)? Julia?
— Cyrus Samii (@cdsamii) January 26, 2020

Among other things, people pointed to the following resources to get you started: