{"id":217715,"date":"2025-01-23T10:27:08","date_gmt":"2025-01-23T15:27:08","guid":{"rendered":"https:\/\/ibkrcampus.com\/campus\/?p=217715"},"modified":"2025-01-23T10:28:13","modified_gmt":"2025-01-23T15:28:13","slug":"artificial-intelligence-and-the-risks-of-harking-hypothesizing-after-the-fact","status":"publish","type":"post","link":"https:\/\/www.interactivebrokers.com\/campus\/ibkr-quant-news\/artificial-intelligence-and-the-risks-of-harking-hypothesizing-after-the-fact\/","title":{"rendered":"Artificial Intelligence and the Risks of Harking (Hypothesizing After-the-Fact)"},"content":{"rendered":"\n<p><em>The post &#8220;Artificial Intelligence and the Risks of Harking (Hypothesizing After-the-Fact)&#8221; first appeared on <a href=\"https:\/\/alphaarchitect.com\/2025\/01\/artificial-intelligence-2\/\">Alpha Architect<\/a> blog.<\/em><\/p>\n\n\n\n<p>Academics have long been aware of the risks of data mining\u2014torturing the data until it confesses. The concern is that correlation of variables doesn\u2019t imply that the correlation is a result of causation. That is the reason that the prevailing academic standard for researchers is that they should first develop their hypothesis and predictions before testing them against the data. To minimize the risks of a study being the result of a data mining exercise, in our book \u201c<a href=\"https:\/\/url.avanan.click\/v2\/___https:\/www.amazon.com\/s?k=your+complete+guide+to+factor-based+investing&amp;crid=20OUOHXSYAPP6&amp;sprefix=your+comple%2Caps%2C271&amp;ref=nb_sb_ss_ts-doa-p_4_11___.YXAzOnNhcmFncmlsbG86YTpnOjBlZjZhOTc1ZDk0YTdiMDQ4NTE4Y2MzNmU3YmNmNGE4OjY6Njc5YTo2ODNlMzkxYTQyMTIxYTQyYTRlYjFjNDg3MmEzNTZkNzI1MTExNDFlODU1Y2Q1NTQwNmNhNGMwOWM2NDE4NDRiOnA6VDpO\" target=\"_blank\" rel=\"noreferrer noopener\">Your Complete Guide to Factor-Based Investing<\/a>,\u201d Andrew Berkin and I recommend that before one should consider investing in a factor-based strategy&nbsp;<em>all<\/em>&nbsp;of the following tests be applied. To start, it must provide explanatory power to portfolio returns and have delivered a premium (higher returns). Additionally, the factor must be:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Persistent \u2014 It holds across long periods of time and different economic regimes.<\/li>\n\n\n\n<li>Pervasive \u2014 It holds across countries, regions, sectors, and even asset classes.<\/li>\n\n\n\n<li>Robust \u2014 It holds for various definitions (for example, there is a value premium whether it is measured by price-to-book, earnings, cash flow, or sales).<\/li>\n\n\n\n<li>Investable \u2014 It holds up not just on paper, but also after considering actual implementation issues, such as trading costs.<\/li>\n\n\n\n<li>Intuitive \u2014 There are logical risk-based or behavioral-based explanations for its premium and why it should continue to exist.<\/li>\n<\/ul>\n\n\n\n<p>The important role of these criteria has increased due to enhanced power of the tools of artificial intelligence and&nbsp;<a href=\"https:\/\/url.avanan.click\/v2\/___https:\/en.wikipedia.org\/wiki\/Large_language_model___.YXAzOnNhcmFncmlsbG86YTpnOjBlZjZhOTc1ZDk0YTdiMDQ4NTE4Y2MzNmU3YmNmNGE4OjY6YTkwYTo1MjRhY2IwZTVjMDVmZmQ1ZTgwOWMxMTE3MTVmMjdhMDA0ZTQ1MDRmYjk0YmRiMzYxMjgwNGZjNDU1YTk4ZjdkOnA6VDpO\" target=\"_blank\" rel=\"noreferrer noopener\">large language models<\/a>&nbsp;(LLMs).<\/p>\n\n\n\n<p><strong>The Role of AI in Financial Research<\/strong><\/p>\n\n\n\n<p>Artificial Intelligence (AI) offers the intriguing potential to revolutionize investment decision-making by providing important advantages such as: &nbsp;<\/p>\n\n\n\n<p><strong>Enhanced Data Analysis<\/strong>: AI can process and analyze vast amounts of data from various sources, including financial news, market trends, and company fundamentals, at a speed and scale far surpassing human capabilities. This enables investors to identify patterns, correlations, and anomalies that may be difficult for humans to detect. &nbsp;<\/p>\n\n\n\n<p><strong>Improved Prediction Accuracy<\/strong>: AI algorithms can leverage historical data and machine learning techniques to build predictive models that forecast future market movements, asset prices, and investment returns with greater accuracy than traditional methods (avoidance of cognitive biases to which humans are susceptible\u2014AI is more rational).<\/p>\n\n\n\n<p>However, using AI to build predictive models increases the risks of data mining outcomes. In their December 2024 paper, \u201c<a href=\"https:\/\/url.avanan.click\/v2\/___https:\/papers.ssrn.com\/sol3\/papers.cfm?abstract_id=5060022___.YXAzOnNhcmFncmlsbG86YTpnOjBlZjZhOTc1ZDk0YTdiMDQ4NTE4Y2MzNmU3YmNmNGE4OjY6ZGZhYzo4NjI3MWMxMWQ1NWIwYzJmOTg0MDU5YTVkYTI3MzJlMGJhNDY0N2JlZDZhNzUzN2Y5MjA4MTRmNzljOTFhOGZmOnA6VDpO\" target=\"_blank\" rel=\"noreferrer noopener\">AI-Powered (Finance) Scholarship<\/a>,\u201d authors Robert Novy-Marx and Mihail Velikov began by noting that the prior research (on&nbsp;<a href=\"https:\/\/arxiv.org\/html\/2409.04109v1\" target=\"_blank\" rel=\"noreferrer noopener\">LLMs and novel research<\/a>, &nbsp;<a href=\"https:\/\/www.ethicalpsychology.com\/2024\/06\/can-generative-ai-improve-social-science.html\" target=\"_blank\" rel=\"noreferrer noopener\">Generative AI,<\/a>&nbsp;language models, , the&nbsp;<a href=\"https:\/\/arxiv.org\/html\/2404.01268v1\" target=\"_blank\" rel=\"noreferrer noopener\">increasing use of LLMs<\/a>&nbsp;in scientific papers, and&nbsp;<a href=\"https:\/\/arxiv.org\/pdf\/2408.06292\" target=\"_blank\" rel=\"noreferrer noopener\">scientific discovery<\/a>&nbsp;has shown that AI systems can not only meaningfully engage with economic reasoning and prediction, but that it is capable of testing scientific hypotheses in silico\u2014using computer programs and algorithms to model a system, simulate experiments and analyze data.<\/p>\n\n\n\n<p><strong>Benefits of in silico testing:<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Faster and cheaper:<\/strong>&nbsp;Compared to traditional lab experiments, in silico methods can be much faster and less expensive.<\/li>\n\n\n\n<li><strong>More efficient:<\/strong>&nbsp;Allows researchers to explore a wider range of possibilities and test more hypotheses in a shorter amount of time.<\/li>\n<\/ul>\n\n\n\n<p>Novy-Marx and Velikov described a process for automatically generating academic finance papers using LLMs. They began by mining over 30,000 potential stock return predictor signals from accounting data, and applied the Novy-Marx and Velikov (2024) \u201c<a href=\"https:\/\/url.avanan.click\/v2\/___https:\/papers.ssrn.com\/sol3\/papers.cfm?abstract_id=4338007___.YXAzOnNhcmFncmlsbG86YTpnOjBlZjZhOTc1ZDk0YTdiMDQ4NTE4Y2MzNmU3YmNmNGE4OjY6Yjc2ZDozNzRhOTU5OWZjNjdhNjYwMzI3YmZmMzQ2ODAxMjAwYmRjODUzMDNjYmMzZTUwMTIyMWFlNjY1YzJkOWI0NjRjOnA6VDpO\" target=\"_blank\" rel=\"noreferrer noopener\">Assaying Anomalies<\/a>\u201d protocol to generate standardized \u201ctemplate reports\u201d for the 96 signals that passed the protocol\u2019s rigorous criteria (identifying issues that commonly arise testing equity strategies, paying particular attention to arbitrage limits that can make a strategy look good on paper even when it cannot be profitably traded in practice). &nbsp;Each report detailed a signal\u2019s performance&nbsp;<a href=\"https:\/\/alphaarchitect.com\/2018\/01\/predict-stock-returns-using-various-firm-characteristics\/\" target=\"_blank\" rel=\"noreferrer noopener\">predicting stock returns<\/a>&nbsp;using a wide array of tests and benchmarked it to more than 200 other known anomalies. They then used state-of-the-art LLMs to generate three distinct complete versions of academic papers for each signal. The different versions included creative names for the signals, contained custom introductions providing different theoretical justifications for the observed predictability patterns, and incorporated citations to existing (and, on occasion, imagined) literature supporting their respective claims. &nbsp;As my friend, and co-author, Andrew Berkin pointed out: This is emblematic of some of the problems that currently exist with AI. It will give you an answer, but not necessarily a correct one. For that reason, some call AI a \u201clying machine.\u201d<\/p>\n\n\n\n<p>The \u201c288 fully programmatically-generated papers contain introductions that follow standard academic conventions, developing theoretical arguments that connect the documented return patterns to established economic mechanisms, incorporating citations to existing (and, at least for now, on occasion hallucinated) literature. Each paper includes comprehensive descriptions of the data and methodology, detailed discussion of results, and contextualized conclusions.\u201d<\/p>\n\n\n\n<p>They then used a more advanced LLM (<a href=\"https:\/\/url.avanan.click\/v2\/___https:\/www.anthropic.com\/news\/claude-3-5-sonnet___.YXAzOnNhcmFncmlsbG86YTpnOjBlZjZhOTc1ZDk0YTdiMDQ4NTE4Y2MzNmU3YmNmNGE4OjY6ODMzNDpmN2ZkODlmZWQxNmZjNjM1YzViYzAzMTdhYTU4NzI2NjEwMzAzMGJlZWU5YzBmZTc5M2YwMTdlNGEwOTY3ZTYxOnA6VDpO\" target=\"_blank\" rel=\"noreferrer noopener\">Claude 3.5-Sonnet<\/a>) to generate the core textual content of each paper. For example, the introduction, composed of roughly 1,100 words, was subdivided into four sections to ensure a balanced, academically coherent narrative:<\/p>\n\n\n\n<p>1. Motivation (200 words): Frames the research question within the broader asset pricing literature, discussing market efficiency, cross-sectional predictability, and recent developments in factor research.<\/p>\n\n\n\n<p>2. Hypothesis Development (300 words): Proposes economic mechanisms justifying the signal\u2019s predictive power, citing relevant theoretical and empirical studies to maintain a scholarly tone and contextualize the new factor.<\/p>\n\n\n\n<p>3. Results Summary (300 words): Presents key empirical findings, highlighting statistical significance, robustness checks, and comparisons to established anomalies.<\/p>\n\n\n\n<p>4. Contribution (300 words): Places the proposed signal in relation to 3\u20134 closely related studies, articulating how the new evidence enhances our understanding of systematic return drivers and contributes to ongoing debates in the literature.<\/p>\n\n\n\n<p>All generated text adhered to a formal academic writing style, utilized active voice, and carefully distinguished correlation from causation, avoided unwarranted claims, and ensured appropriate application of tense to reflect established knowledge versus new findings. In addition, citations were embedded using LaTeX-formatted references, and all writing conventions aligned with norms in leading finance journals.<\/p>\n\n\n\n<p>The other added sections of each manuscript, including Data and Conclusion, were generated following similarly structured prompts. They added:<\/p>\n\n\n\n<p>\u201cWhile the papers and their theoretical frameworks are automatically generated, it\u2019s important to note that all empirical analyses and statistical validations are conducted using rigorous methods developed in the academic literature, ensuring the reliability (if not the interpretation) of the underlying findings.\u201d<\/p>\n\n\n\n<p>Novy-Marx and Velikov noted:<\/p>\n\n\n\n<p>\u201cThe process is remarkably efficient \u2013 while the data mining, validation, and generation of the PDF \u201ctemplate reports\u201d from the \u201cAssaying Anomalies\u201d protocol takes about a day of computation time, the final paper generation takes minutes. This represents a dramatic acceleration compared to traditional research paper development.\u201d<\/p>\n\n\n\n<p>Their findings led Novy-Marx and Velikov to conclude:<\/p>\n\n\n\n<p>\u201cThis experiment illustrates AI\u2019s potential for enhancing financial research efficiency, but also serves as a cautionary tale, illustrating how it can be abused to industrialize HARKing (Hypothesizing After Results are Known).\u201d<\/p>\n\n\n\n<p>They added this caution:<\/p>\n\n\n\n<p>\u201cThe ease with which AI can generate convincing theoretical frameworks that reference prior literature may inadvertently create a new form of academic arbitrage \u2013 where researchers can boost their citation counts through automated paper generation. It is actually easy to imagine a scenario in which entire fictitious sub-fields of a literature emerge in which all of the citations are from AI-generated papers to other reciprocally citing AI-generated papers.\u201d<\/p>\n\n\n\n<p><strong>Investor Takeaways<\/strong><\/p>\n\n\n\n<p>Novy-Marx and Velikov provided a concrete demonstration of how LLMs can be used to automate the generation of academic finance papers at scale. \u201cOur results show that AI can now develop hypotheses at an unprecedented scale.\u201d They then demonstrated: The emergence of sophisticated AI systems capable of generating (multiple) plausible theoretical frameworks at scale poses novel challenges to traditional mechanisms used to judge the reliability of research findings. The takeaway then is that because AI systems can produce hundreds of seemingly coherent theoretical explanations for mined empirical results, investors need to establish high hurdles before allocating to anomaly-based strategies.<\/p>\n\n\n\n<p><em>Larry Swedroe is the author or co-author of 18 books on investing, including his latest&nbsp;<\/em><a href=\"https:\/\/url.avanan.click\/v2\/___https:\/www.amazon.com\/Enrich-Your-Future-Successful-Investing\/dp\/1394245440\/ref=sr_1_1?crid=3CRTRKLGGTIUZ&amp;dib=eyJ2IjoiMSJ9.R1VwlsG9zUQwWdwscc3NP2weOd68TWy06Pkb7dbZIGAvQh495OyVuwNZUlbqWCFfThmw4F0kLezZuasIfAqDfuL_8FIZ8W7G52DNVhST1gw.Ku79ZOt9_Acsmcid89f59xh--PWH68jBasVttUiv7us&amp;dib_tag=se&amp;keywords=enrich+your+future+larry+swedroe&amp;qid=1734641782&amp;sprefix=enrich+your+futgure%2Caps%2C200&amp;sr=8-1___.YXAzOnNhcmFncmlsbG86YTpnOmJlMTM3ZThiZWNhNWI3NGNkODRiNzJiZGZmYzM0OTVhOjY6ODBlMTo0N2U5MDA1MmNhNWRiYmU1ODhlYTg1YzI1YWEzNGM3OGJmNGE1OWIyZGY3MTE1YTM5ZjA1NjkzN2Y4OWI5M2Y1OnA6VDpO\" target=\"_blank\" rel=\"noreferrer noopener\"><em>Enrich Your Future.<\/em><\/a><\/p>\n\n\n\n<p><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Academics have long been aware of the risks of data mining-torturing the data until it confesses. <\/p>\n","protected":false},"author":298,"featured_media":202338,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[339,338,341],"tags":[912,8485,806,18293],"contributors-categories":[13651],"class_list":{"0":"post-217715","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-data-science","8":"category-ibkr-quant-news","9":"category-quant-development","10":"tag-artificial-intelligence","11":"tag-data-mining","12":"tag-data-science","13":"tag-harking-hypothesizing-after-the-fact","14":"contributors-categories-alpha-architect"},"pp_statuses_selecting_workflow":false,"pp_workflow_action":"current","pp_status_selection":"publish","acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v26.9 (Yoast SEO v27.7) - https:\/\/yoast.com\/product\/yoast-seo-premium-wordpress\/ -->\n<title>Artificial Intelligence and the Risks of Harking (Hypothesizing After-the-Fact)<\/title>\n<meta name=\"description\" content=\"Academics have long been aware of the risks of data mining-torturing the data until it confesses.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.interactivebrokers.com\/campus\/wp-json\/wp\/v2\/posts\/217715\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Artificial Intelligence and the Risks of Harking (Hypothesizing After-the-Fact)\" \/>\n<meta property=\"og:description\" content=\"Academics have long been aware of the risks of data mining-torturing the data until it confesses.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.interactivebrokers.com\/campus\/ibkr-quant-news\/artificial-intelligence-and-the-risks-of-harking-hypothesizing-after-the-fact\/\" \/>\n<meta property=\"og:site_name\" content=\"IBKR Campus US\" \/>\n<meta property=\"article:published_time\" content=\"2025-01-23T15:27:08+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-01-23T15:28:13+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.interactivebrokers.com\/campus\/wp-content\/uploads\/sites\/2\/2024\/02\/blue-screen-charts-data-science.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"1000\" \/>\n\t<meta property=\"og:image:height\" content=\"563\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Larry Swedroe\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Larry Swedroe\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"6 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\n\t    \"@context\": \"https:\\\/\\\/schema.org\",\n\t    \"@graph\": [\n\t        {\n\t            \"@type\": \"NewsArticle\",\n\t            \"@id\": \"https:\\\/\\\/ibkrcampus.com\\\/campus\\\/ibkr-quant-news\\\/artificial-intelligence-and-the-risks-of-harking-hypothesizing-after-the-fact\\\/#article\",\n\t            \"isPartOf\": {\n\t                \"@id\": \"https:\\\/\\\/ibkrcampus.com\\\/campus\\\/ibkr-quant-news\\\/artificial-intelligence-and-the-risks-of-harking-hypothesizing-after-the-fact\\\/\"\n\t            },\n\t            \"author\": {\n\t                \"name\": \"Larry Swedroe\",\n\t                \"@id\": \"https:\\\/\\\/ibkrcampus.com\\\/campus\\\/#\\\/schema\\\/person\\\/75544487e0633f170eb31cb59c37e64f\"\n\t            },\n\t            \"headline\": \"Artificial Intelligence and the Risks of Harking (Hypothesizing After-the-Fact)\",\n\t            \"datePublished\": \"2025-01-23T15:27:08+00:00\",\n\t            \"dateModified\": \"2025-01-23T15:28:13+00:00\",\n\t            \"mainEntityOfPage\": {\n\t                \"@id\": \"https:\\\/\\\/ibkrcampus.com\\\/campus\\\/ibkr-quant-news\\\/artificial-intelligence-and-the-risks-of-harking-hypothesizing-after-the-fact\\\/\"\n\t            },\n\t            \"wordCount\": 1330,\n\t            \"commentCount\": 0,\n\t            \"publisher\": {\n\t                \"@id\": \"https:\\\/\\\/ibkrcampus.com\\\/campus\\\/#organization\"\n\t            },\n\t            \"image\": {\n\t                \"@id\": \"https:\\\/\\\/ibkrcampus.com\\\/campus\\\/ibkr-quant-news\\\/artificial-intelligence-and-the-risks-of-harking-hypothesizing-after-the-fact\\\/#primaryimage\"\n\t            },\n\t            \"thumbnailUrl\": \"https:\\\/\\\/www.interactivebrokers.com\\\/campus\\\/wp-content\\\/uploads\\\/sites\\\/2\\\/2024\\\/02\\\/blue-screen-charts-data-science.jpg\",\n\t            \"keywords\": [\n\t                \"Artificial Intelligence\",\n\t                \"Data Mining\",\n\t                \"Data Science\",\n\t                \"Harking (Hypothesizing After-the-Fact)\"\n\t            ],\n\t            \"articleSection\": [\n\t                \"Data Science\",\n\t                \"Quant\",\n\t                \"Quant Development\"\n\t            ],\n\t            \"inLanguage\": \"en-US\",\n\t            \"potentialAction\": [\n\t                {\n\t                    \"@type\": \"CommentAction\",\n\t                    \"name\": \"Comment\",\n\t                    \"target\": [\n\t                        \"https:\\\/\\\/ibkrcampus.com\\\/campus\\\/ibkr-quant-news\\\/artificial-intelligence-and-the-risks-of-harking-hypothesizing-after-the-fact\\\/#respond\"\n\t                    ]\n\t                }\n\t            ]\n\t        },\n\t        {\n\t            \"@type\": \"WebPage\",\n\t            \"@id\": \"https:\\\/\\\/ibkrcampus.com\\\/campus\\\/ibkr-quant-news\\\/artificial-intelligence-and-the-risks-of-harking-hypothesizing-after-the-fact\\\/\",\n\t            \"url\": \"https:\\\/\\\/ibkrcampus.com\\\/campus\\\/ibkr-quant-news\\\/artificial-intelligence-and-the-risks-of-harking-hypothesizing-after-the-fact\\\/\",\n\t            \"name\": \"Artificial Intelligence and the Risks of Harking (Hypothesizing After-the-Fact) | IBKR Campus US\",\n\t            \"isPartOf\": {\n\t                \"@id\": \"https:\\\/\\\/ibkrcampus.com\\\/campus\\\/#website\"\n\t            },\n\t            \"primaryImageOfPage\": {\n\t                \"@id\": \"https:\\\/\\\/ibkrcampus.com\\\/campus\\\/ibkr-quant-news\\\/artificial-intelligence-and-the-risks-of-harking-hypothesizing-after-the-fact\\\/#primaryimage\"\n\t            },\n\t            \"image\": {\n\t                \"@id\": \"https:\\\/\\\/ibkrcampus.com\\\/campus\\\/ibkr-quant-news\\\/artificial-intelligence-and-the-risks-of-harking-hypothesizing-after-the-fact\\\/#primaryimage\"\n\t            },\n\t            \"thumbnailUrl\": \"https:\\\/\\\/www.interactivebrokers.com\\\/campus\\\/wp-content\\\/uploads\\\/sites\\\/2\\\/2024\\\/02\\\/blue-screen-charts-data-science.jpg\",\n\t            \"datePublished\": \"2025-01-23T15:27:08+00:00\",\n\t            \"dateModified\": \"2025-01-23T15:28:13+00:00\",\n\t            \"description\": \"Academics have long been aware of the risks of data mining-torturing the data until it confesses.\",\n\t            \"inLanguage\": \"en-US\",\n\t            \"potentialAction\": [\n\t                {\n\t                    \"@type\": \"ReadAction\",\n\t                    \"target\": [\n\t                        \"https:\\\/\\\/ibkrcampus.com\\\/campus\\\/ibkr-quant-news\\\/artificial-intelligence-and-the-risks-of-harking-hypothesizing-after-the-fact\\\/\"\n\t                    ]\n\t                }\n\t            ]\n\t        },\n\t        {\n\t            \"@type\": \"ImageObject\",\n\t            \"inLanguage\": \"en-US\",\n\t            \"@id\": \"https:\\\/\\\/ibkrcampus.com\\\/campus\\\/ibkr-quant-news\\\/artificial-intelligence-and-the-risks-of-harking-hypothesizing-after-the-fact\\\/#primaryimage\",\n\t            \"url\": \"https:\\\/\\\/www.interactivebrokers.com\\\/campus\\\/wp-content\\\/uploads\\\/sites\\\/2\\\/2024\\\/02\\\/blue-screen-charts-data-science.jpg\",\n\t            \"contentUrl\": \"https:\\\/\\\/www.interactivebrokers.com\\\/campus\\\/wp-content\\\/uploads\\\/sites\\\/2\\\/2024\\\/02\\\/blue-screen-charts-data-science.jpg\",\n\t            \"width\": 1000,\n\t            \"height\": 563,\n\t            \"caption\": \"Data Science\"\n\t        },\n\t        {\n\t            \"@type\": \"WebSite\",\n\t            \"@id\": \"https:\\\/\\\/ibkrcampus.com\\\/campus\\\/#website\",\n\t            \"url\": \"https:\\\/\\\/ibkrcampus.com\\\/campus\\\/\",\n\t            \"name\": \"IBKR Campus US\",\n\t            \"description\": \"Financial Education from Interactive Brokers\",\n\t            \"publisher\": {\n\t                \"@id\": \"https:\\\/\\\/ibkrcampus.com\\\/campus\\\/#organization\"\n\t            },\n\t            \"potentialAction\": [\n\t                {\n\t                    \"@type\": \"SearchAction\",\n\t                    \"target\": {\n\t                        \"@type\": \"EntryPoint\",\n\t                        \"urlTemplate\": \"https:\\\/\\\/ibkrcampus.com\\\/campus\\\/?s={search_term_string}\"\n\t                    },\n\t                    \"query-input\": {\n\t                        \"@type\": \"PropertyValueSpecification\",\n\t                        \"valueRequired\": true,\n\t                        \"valueName\": \"search_term_string\"\n\t                    }\n\t                }\n\t            ],\n\t            \"inLanguage\": \"en-US\"\n\t        },\n\t        {\n\t            \"@type\": \"Organization\",\n\t            \"@id\": \"https:\\\/\\\/ibkrcampus.com\\\/campus\\\/#organization\",\n\t            \"name\": \"Interactive Brokers\",\n\t            \"alternateName\": \"IBKR\",\n\t            \"url\": \"https:\\\/\\\/ibkrcampus.com\\\/campus\\\/\",\n\t            \"logo\": {\n\t                \"@type\": \"ImageObject\",\n\t                \"inLanguage\": \"en-US\",\n\t                \"@id\": \"https:\\\/\\\/ibkrcampus.com\\\/campus\\\/#\\\/schema\\\/logo\\\/image\\\/\",\n\t                \"url\": \"https:\\\/\\\/www.interactivebrokers.com\\\/campus\\\/wp-content\\\/uploads\\\/sites\\\/2\\\/2024\\\/05\\\/ibkr-campus-logo.jpg\",\n\t                \"contentUrl\": \"https:\\\/\\\/www.interactivebrokers.com\\\/campus\\\/wp-content\\\/uploads\\\/sites\\\/2\\\/2024\\\/05\\\/ibkr-campus-logo.jpg\",\n\t                \"width\": 669,\n\t                \"height\": 669,\n\t                \"caption\": \"Interactive Brokers\"\n\t            },\n\t            \"image\": {\n\t                \"@id\": \"https:\\\/\\\/ibkrcampus.com\\\/campus\\\/#\\\/schema\\\/logo\\\/image\\\/\"\n\t            },\n\t            \"publishingPrinciples\": \"https:\\\/\\\/www.interactivebrokers.com\\\/campus\\\/about-ibkr-campus\\\/\",\n\t            \"ethicsPolicy\": \"https:\\\/\\\/www.interactivebrokers.com\\\/campus\\\/cyber-security-notice\\\/\"\n\t        },\n\t        {\n\t            \"@type\": \"Person\",\n\t            \"@id\": \"https:\\\/\\\/ibkrcampus.com\\\/campus\\\/#\\\/schema\\\/person\\\/75544487e0633f170eb31cb59c37e64f\",\n\t            \"name\": \"Larry Swedroe\",\n\t            \"description\": \"As Chief Research Officer for Buckingham Strategic Wealth and Buckingham Strategic Partners, Larry Swedroe spends his time, talent and energy educating investors on the benefits of evidence-based investing with enthusiasm few can match. https:\\\/\\\/twitter.com\\\/larryswedroe\",\n\t            \"url\": \"https:\\\/\\\/www.interactivebrokers.com\\\/campus\\\/author\\\/larryswedroe\\\/\"\n\t        }\n\t    ]\n\t}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"Artificial Intelligence and the Risks of Harking (Hypothesizing After-the-Fact)","description":"Academics have long been aware of the risks of data mining-torturing the data until it confesses.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.interactivebrokers.com\/campus\/wp-json\/wp\/v2\/posts\/217715\/","og_locale":"en_US","og_type":"article","og_title":"Artificial Intelligence and the Risks of Harking (Hypothesizing After-the-Fact)","og_description":"Academics have long been aware of the risks of data mining-torturing the data until it confesses.","og_url":"https:\/\/www.interactivebrokers.com\/campus\/ibkr-quant-news\/artificial-intelligence-and-the-risks-of-harking-hypothesizing-after-the-fact\/","og_site_name":"IBKR Campus US","article_published_time":"2025-01-23T15:27:08+00:00","article_modified_time":"2025-01-23T15:28:13+00:00","og_image":[{"width":1000,"height":563,"url":"https:\/\/www.interactivebrokers.com\/campus\/wp-content\/uploads\/sites\/2\/2024\/02\/blue-screen-charts-data-science.jpg","type":"image\/jpeg"}],"author":"Larry Swedroe","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Larry Swedroe","Est. reading time":"6 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"NewsArticle","@id":"https:\/\/ibkrcampus.com\/campus\/ibkr-quant-news\/artificial-intelligence-and-the-risks-of-harking-hypothesizing-after-the-fact\/#article","isPartOf":{"@id":"https:\/\/ibkrcampus.com\/campus\/ibkr-quant-news\/artificial-intelligence-and-the-risks-of-harking-hypothesizing-after-the-fact\/"},"author":{"name":"Larry Swedroe","@id":"https:\/\/ibkrcampus.com\/campus\/#\/schema\/person\/75544487e0633f170eb31cb59c37e64f"},"headline":"Artificial Intelligence and the Risks of Harking (Hypothesizing After-the-Fact)","datePublished":"2025-01-23T15:27:08+00:00","dateModified":"2025-01-23T15:28:13+00:00","mainEntityOfPage":{"@id":"https:\/\/ibkrcampus.com\/campus\/ibkr-quant-news\/artificial-intelligence-and-the-risks-of-harking-hypothesizing-after-the-fact\/"},"wordCount":1330,"commentCount":0,"publisher":{"@id":"https:\/\/ibkrcampus.com\/campus\/#organization"},"image":{"@id":"https:\/\/ibkrcampus.com\/campus\/ibkr-quant-news\/artificial-intelligence-and-the-risks-of-harking-hypothesizing-after-the-fact\/#primaryimage"},"thumbnailUrl":"https:\/\/www.interactivebrokers.com\/campus\/wp-content\/uploads\/sites\/2\/2024\/02\/blue-screen-charts-data-science.jpg","keywords":["Artificial Intelligence","Data Mining","Data Science","Harking (Hypothesizing After-the-Fact)"],"articleSection":["Data Science","Quant","Quant Development"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/ibkrcampus.com\/campus\/ibkr-quant-news\/artificial-intelligence-and-the-risks-of-harking-hypothesizing-after-the-fact\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/ibkrcampus.com\/campus\/ibkr-quant-news\/artificial-intelligence-and-the-risks-of-harking-hypothesizing-after-the-fact\/","url":"https:\/\/ibkrcampus.com\/campus\/ibkr-quant-news\/artificial-intelligence-and-the-risks-of-harking-hypothesizing-after-the-fact\/","name":"Artificial Intelligence and the Risks of Harking (Hypothesizing After-the-Fact) | IBKR Campus US","isPartOf":{"@id":"https:\/\/ibkrcampus.com\/campus\/#website"},"primaryImageOfPage":{"@id":"https:\/\/ibkrcampus.com\/campus\/ibkr-quant-news\/artificial-intelligence-and-the-risks-of-harking-hypothesizing-after-the-fact\/#primaryimage"},"image":{"@id":"https:\/\/ibkrcampus.com\/campus\/ibkr-quant-news\/artificial-intelligence-and-the-risks-of-harking-hypothesizing-after-the-fact\/#primaryimage"},"thumbnailUrl":"https:\/\/www.interactivebrokers.com\/campus\/wp-content\/uploads\/sites\/2\/2024\/02\/blue-screen-charts-data-science.jpg","datePublished":"2025-01-23T15:27:08+00:00","dateModified":"2025-01-23T15:28:13+00:00","description":"Academics have long been aware of the risks of data mining-torturing the data until it confesses.","inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/ibkrcampus.com\/campus\/ibkr-quant-news\/artificial-intelligence-and-the-risks-of-harking-hypothesizing-after-the-fact\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/ibkrcampus.com\/campus\/ibkr-quant-news\/artificial-intelligence-and-the-risks-of-harking-hypothesizing-after-the-fact\/#primaryimage","url":"https:\/\/www.interactivebrokers.com\/campus\/wp-content\/uploads\/sites\/2\/2024\/02\/blue-screen-charts-data-science.jpg","contentUrl":"https:\/\/www.interactivebrokers.com\/campus\/wp-content\/uploads\/sites\/2\/2024\/02\/blue-screen-charts-data-science.jpg","width":1000,"height":563,"caption":"Data Science"},{"@type":"WebSite","@id":"https:\/\/ibkrcampus.com\/campus\/#website","url":"https:\/\/ibkrcampus.com\/campus\/","name":"IBKR Campus US","description":"Financial Education from Interactive Brokers","publisher":{"@id":"https:\/\/ibkrcampus.com\/campus\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/ibkrcampus.com\/campus\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/ibkrcampus.com\/campus\/#organization","name":"Interactive Brokers","alternateName":"IBKR","url":"https:\/\/ibkrcampus.com\/campus\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/ibkrcampus.com\/campus\/#\/schema\/logo\/image\/","url":"https:\/\/www.interactivebrokers.com\/campus\/wp-content\/uploads\/sites\/2\/2024\/05\/ibkr-campus-logo.jpg","contentUrl":"https:\/\/www.interactivebrokers.com\/campus\/wp-content\/uploads\/sites\/2\/2024\/05\/ibkr-campus-logo.jpg","width":669,"height":669,"caption":"Interactive Brokers"},"image":{"@id":"https:\/\/ibkrcampus.com\/campus\/#\/schema\/logo\/image\/"},"publishingPrinciples":"https:\/\/www.interactivebrokers.com\/campus\/about-ibkr-campus\/","ethicsPolicy":"https:\/\/www.interactivebrokers.com\/campus\/cyber-security-notice\/"},{"@type":"Person","@id":"https:\/\/ibkrcampus.com\/campus\/#\/schema\/person\/75544487e0633f170eb31cb59c37e64f","name":"Larry Swedroe","description":"As Chief Research Officer for Buckingham Strategic Wealth and Buckingham Strategic Partners, Larry Swedroe spends his time, talent and energy educating investors on the benefits of evidence-based investing with enthusiasm few can match. https:\/\/twitter.com\/larryswedroe","url":"https:\/\/www.interactivebrokers.com\/campus\/author\/larryswedroe\/"}]}},"jetpack_featured_media_url":"https:\/\/www.interactivebrokers.com\/campus\/wp-content\/uploads\/sites\/2\/2024\/02\/blue-screen-charts-data-science.jpg","_links":{"self":[{"href":"https:\/\/ibkrcampus.com\/campus\/wp-json\/wp\/v2\/posts\/217715","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/ibkrcampus.com\/campus\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/ibkrcampus.com\/campus\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/ibkrcampus.com\/campus\/wp-json\/wp\/v2\/users\/298"}],"replies":[{"embeddable":true,"href":"https:\/\/ibkrcampus.com\/campus\/wp-json\/wp\/v2\/comments?post=217715"}],"version-history":[{"count":0,"href":"https:\/\/ibkrcampus.com\/campus\/wp-json\/wp\/v2\/posts\/217715\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/ibkrcampus.com\/campus\/wp-json\/wp\/v2\/media\/202338"}],"wp:attachment":[{"href":"https:\/\/ibkrcampus.com\/campus\/wp-json\/wp\/v2\/media?parent=217715"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/ibkrcampus.com\/campus\/wp-json\/wp\/v2\/categories?post=217715"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/ibkrcampus.com\/campus\/wp-json\/wp\/v2\/tags?post=217715"},{"taxonomy":"contributors-categories","embeddable":true,"href":"https:\/\/ibkrcampus.com\/campus\/wp-json\/wp\/v2\/contributors-categories?post=217715"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}