Designing and operating the full infrastructure of a systematic investment research program: data acquisition and engineering, analysis tooling, experiment design, and team prioritization. The research platform is not just software — it determines which hypotheses can be tested, how quickly, and with what rigor. Research process quality is a sustainable competitive moat because it compounds: good infrastructure enables better experiments which generates better edges.
Practitioner maintains a live research platform where data, analysis tools, and team skills are continuously upgraded. Prioritizes research projects by AUM-scaled information ratio (expected alpha per unit of research effort, scaled by how much capital the strategy can absorb). Separates data processing (commodity, can outsource) from analytics (proprietary, must own). Maintains a spectrum from strong-prior (academic empirical finance) to weak-prior (machine learning, allows data to reveal structure) research approaches to achieve framework diversity.
Strong-prior (academic empirical finance) and weak-prior (machine learning) research are not just different discovery tools — they serve complementary lifecycle roles. Strong-prior is better at initial hypothesis development with theoretical grounding. Weak-prior is better at detecting when a historically strong-prior factor is decaying in real-time because it has no attachment to the original thesis.
The standard practice of purchasing vendor analytics (processed factor scores from Barra, FactSet, etc.) creates a structural blind spot: when the factor decays, you cannot diagnose why because you don't own the mechanism. The edge is not in having data — it is in the causal chain from raw data to return. That chain must be owned in-house.
Research instinct focuses on finding one great data source — the satellite data, the alternative signal, the unique insight. But the empirical evidence from mature systematic shops is that the edge comes from building a richer information mosaic than competitors, not from superior processing of any single source. Combining five independent partial signals that are each 55% predictive produces a more reliable combined signal than one 65% predictive source, because the combination reduces the variance around the prediction.