{"id":472048,"date":"2026-02-13T08:54:21","date_gmt":"2026-02-13T08:54:21","guid":{"rendered":"https:\/\/www.newsbeep.com\/ca\/472048\/"},"modified":"2026-02-13T08:54:21","modified_gmt":"2026-02-13T08:54:21","slug":"can-ai-write-a-useful-philosophical-literature-review-guest-post","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/ca\/472048\/","title":{"rendered":"Can AI Write a Useful Philosophical Literature Review? (guest post)"},"content":{"rendered":"<p>A pair of philosophers have developed a new research tool that uses AI to provide comprehensive and reliable philosophical literature reviews, and they\u2019d like you to give it a try.<\/p>\n<p>Just last week I checked out <a href=\"https:\/\/openscilm.allen.ai\/\" target=\"_blank\" rel=\"noopener nofollow\">a new AI tool<\/a>\u00a0discussed in\u00a0<a href=\"https:\/\/www.nature.com\/articles\/s41586-025-10072-4\" target=\"_blank\" rel=\"noopener nofollow\">Nature<\/a>\u00a0that is supposed to be able to \u201csynthesize scientific literature\u201d. As good as it may be at that (I\u2019m not in a position to judge), I can tell you that it didn\u2019t seem to have access to much philosophy, and so was not of any use for philosophical inquiries. And general LLMs like ChatGPT may pull from random or odd or even imaginary sources, making them difficult to trust.<\/p>\n<p>Still, for some, the idea of an AI philosophy research assistant has significant appeal, and now, thanks to\u00a0<a href=\"https:\/\/johanneshimmelreich.net\/\" target=\"_blank\" rel=\"noopener nofollow\">Johannes Himmelreich<\/a> (Syracuse) and <a href=\"https:\/\/marcomeyer.net\/\" target=\"_blank\" rel=\"noopener nofollow\">Marco Meyer<\/a> (Hamburg), you can see for yourself what one could do and what you think about it.<\/p>\n<p>They call their tool\u00a0PhilLit, and in the following guest post, they explain why they made it, what it does, and how you can try it.<\/p>\n<p><img fetchpriority=\"high\" decoding=\"async\" class=\"wp-image-56559 aligncenter\" src=\"https:\/\/www.newsbeep.com\/ca\/wp-content\/uploads\/2026\/02\/phillit-1024x673.png\" alt=\"\" width=\"761\" height=\"500\"  \/><\/p>\n<p>Can AI Write a Useful Philosophical Literature Review?<br \/>by Johannes Himmelreich and Marco Meyer<\/p>\n<p>A year ago, the best AI model could complete tasks that take a human expert 56 minutes. Today, this same metric, the <a href=\"https:\/\/metr.org\/time-horizons\/\" rel=\"nofollow noopener\" target=\"_blank\">task-completion time horizon<\/a>, is around 6.5 hours.<a href=\"#_ftn1\" name=\"_ftnref1\">[1]<\/a> These numbers were derived from tasks used in software development. How much better did AI get in the past 12 months at tasks that we use in philosophy?<\/p>\n<p>Unfortunately, nobody knows. As philosophers, we might want to know whether and how AI can be used for philosophy. Of course, asking \u201chow AI can be used for philosophy\u201d in the abstract is about as fruitful as asking \u201chow the internet can be used for philosophy\u201d\u2014it depends on the philosophical task and the corner of the internet where you look for help.<\/p>\n<p>Recently, this blog hosted <a href=\"https:\/\/dailynous.com\/2026\/01\/19\/have-pen-laptop-and-chatgpt-will-publish-guest-post\/\" rel=\"nofollow noopener\" target=\"_blank\">a guide<\/a> on whether AI can help develop research ideas \u00a0through conversations. Conversations are a general-purpose tool for cognitive work. But research also involves certain more specific tasks.<\/p>\n<p>AI can help with at least one specific task that we as researchers undertake regularly: orienting ourselves in unfamiliar literature. But asking ChatGPT to do so won\u2019t do. Even the research agents of the leading AI labs can\u2019t reliably get the facts right or limit themselves to academic literature, let alone consider how different debates relate to one another. Generic AI research agents fabricate citations and can\u2019t distinguish high-quality philosophical research from other content.<\/p>\n<p>We\u2019ve built a tool that does better. It\u2019s called <a href=\"https:\/\/github.com\/AI-4-Phi\/PhilLit\" rel=\"nofollow noopener\" target=\"_blank\">PhilLit<\/a>. It\u2019s open source, runs on Claude, and is free to use with a Claude Code subscription. In this post, we explain what the tool does and why it addresses a real need.<\/p>\n<p>You can take a look at example literature reviews about the <a href=\"https:\/\/raw.githubusercontent.com\/AI-4-Phi\/PhilLit\/main\/reviews\/extended-mind-cognitive-offloading\/cognitive-offloading-review.pdf\" rel=\"nofollow noopener\" target=\"_blank\">Extended Mind and Cognitive Offloading<\/a>, the <a href=\"https:\/\/raw.githubusercontent.com\/AI-4-Phi\/PhilLit\/main\/reviews\/metaphilosophy-literature-reviews\/lit-review-review.pdf\" rel=\"nofollow noopener\" target=\"_blank\">Metaphilosophy of Literature Reviews<\/a>, and the <a href=\"https:\/\/raw.githubusercontent.com\/AI-4-Phi\/PhilLit\/main\/reviews\/moral-value-diy\/diy-ethics-review.pdf\" rel=\"nofollow noopener\" target=\"_blank\">Moral Value of DIY<\/a>. If you are comfortable with the command line or the Terminal app on Mac, you can generate reviews on whatever topic you like. In principle, the tool could run on a website with a friendly and easy-to-use interface. But for now, we concentrate on how well it works before improving how easy it is to use. To assess whether this tool lives up to the standards required for serious philosophical research, we are preparing a research study.<\/p>\n<p>What PhilLit is for<\/p>\n<p>PhilLit is for philosophers who want an up-to-date overview of the philosophical literature on a topic. Maybe you\u2019re an ethicist who needs to understand debates in the epistemology of testimony. Maybe you work on philosophy of mind and want to engage with recent work on AI agency. Maybe you\u2019re writing a grant proposal that crosses subfield boundaries.<\/p>\n<p>What do you do? You ask colleagues. But they may not work on the specific intersection you need. You look for an SEP article. It is excellent when one exists, but the Stanford Encyclopedia doesn\u2019t cover every topic, entries can lag years behind the latest work, and they\u2019re written for a general audience rather than oriented toward your specific question. You browse PhilPapers. It gives you papers but no map of the debate. In desperation, you ask ChatGPT. It is fast, but you can\u2019t trust the citations, and some of the sources it cites are obscure posts on Reddit. None of these give you what you actually need: a reliable, up-to-date overview of the philosophical literature on a topic, organized around the key debates and positions, with a verified bibliography you can start reading from.<\/p>\n<p>What PhilLit does<\/p>\n<p>PhilLit tries to solve this problem. You give it a research topic or question, and it produces two things: an analytical overview of the literature (roughly 3,000\u20134,000 words) organized around key debates and positions, plus a verified and annotated bibliography in BibTeX format that you can import directly into your reference manager of choice.<\/p>\n<p>Think of the output as a personalized, up-to-date SEP-like article, tailored to your specific research question. Unlike a static encyclopedia entry, PhilLit can regenerate a current overview anytime\u2014a step toward what a continuously updated SEP might look like.<\/p>\n<p>To be clear about what PhilLit is not: it\u2019s not meant to write the literature review section of your paper, or to produce text for journal submissions or grant applications. It\u2019s a research tool. The aim is not to produce more philosophical text, but to make feasible the kind of thorough engagement with adjacent literatures that good research requires and that time constraints often prevent. The output is a starting point for doing the philosophical work yourself: reading the papers, forming your own views, and identifying where your contribution fits.<\/p>\n<p>As a slogan, the idea of using AI to augment research is to <a href=\"https:\/\/freesystems.substack.com\/p\/the-100x-research-institution?triedRedirect=true\" rel=\"nofollow noopener\" target=\"_blank\">put in 100x the research effort, not publish 100x more<\/a>.<\/p>\n<p>Is PhilLit better than ChatGPT?<\/p>\n<p>You might wonder why we built a dedicated tool when you could just prompt the Research feature of Claude or ChatGPT with \u201cwrite me a literature review on X.\u201d PhilLit is built on Anthropic\u2019s Claude, but it is designed to meet the requirements of philosophical research. Three design features matter most:<\/p>\n<p style=\"padding-left: 40px;\">PhilLit searches relevant databases. Every paper in the output was found by searching actual academic databases: PhilLit searches the Stanford Encyclopedia of Philosophy, PhilPapers, Notre Dame Philosophical Reviews, Semantic Scholar, OpenAlex, arXiv, and CrossRef. The system queries the same sources you\u2019d search yourself, and nothing else.<\/p>\n<p style=\"padding-left: 40px;\">PhilLit verifies every citation. The system includes a verification process. Every bibliographic detail, e.g.\u00a0title, author, journal name, volume number, page range, year in the case of a journal article, is checked against API data in bibliographic databases. If a detail can\u2019t be verified against an authoritative source, it\u2019s removed. This means the bibliography may occasionally have gaps (a missing volume number), but it won\u2019t contain fabrications.<\/p>\n<p style=\"padding-left: 40px;\">PhilLit is built for philosophy. Most AI research tools, including <a href=\"https:\/\/openscilm.allen.ai\/\" rel=\"nofollow noopener\" target=\"_blank\">recently-released open-source products<\/a>, are designed with disciplines like biomedicine or computer science in mind. PhilLit, by contrast, organizes reviews by identifying arguments and positions in philosophical debates. And because its search process is systematic rather than relying on any individual\u2019s scholarly network, it can be directed to seek out work on neglected topics or from underrepresented traditions. Such a system could go some way toward correcting biases that inevitably arise from the way in which we otherwise discover and disseminate knowledge.<\/p>\n<p>Is PhilLit any good in practice?<\/p>\n<p>At minimum, a literature review should be accurate (about metadata and interpretation), comprehensive, analytically perceptive, and written in a helpful way. To what extent reviews generated by PhilLit possess these qualities is largely an empirical question. The agent architecture that we developed addresses some serious failures of other literature review agents. But how far that gets us\u2014we don\u2019t know.<\/p>\n<p>Anyone can use PhilLit now. We\u2019re excited to hear about what it does and doesn\u2019t do well.<\/p>\n<p>Moreover, to assess PhilLit rigorously, we\u2019re launching two validation studies (pending IRB approval). We\u2019re looking for philosophers willing to test PhilLit on topics they already know well.<\/p>\n<p>How to use it<\/p>\n<p>PhilLit is open source and free to download. The only cost of using it is paying to access the Claude family of models developed by Anthropic. If you already have a subscription to Claude Code, you can use PhilLit at no additional cost. If you use pay-as-you-go API credits, a review should cost you 9 to 13 USD on average, depending on whether you choose to use the cheaper (Sonnet 4.5) or the more expensive model (Opus 4.6 on high effort). You will be paying Anthropic, but not us.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-56560\" class=\" wp-image-56560\" src=\"https:\/\/www.newsbeep.com\/ca\/wp-content\/uploads\/2026\/02\/phillit_screenshot.png\" alt=\"\" width=\"900\" height=\"548\"  \/><\/p>\n<p id=\"caption-attachment-56560\" class=\"wp-caption-text\">Using PhilLit in Claude Code<\/p>\n<p>You can try PhilLit in two ways.<\/p>\n<p> Run it yourself: If you are comfortable with Python and the command line, you can install the tool directly from GitHub. You will need:<\/p>\n<p>Python installed on your machine.<br \/>\nAPI keys from Anthropic, Semantic Scholar, and Brave Search.<br \/>\nFamiliarity with running scripts in a terminal.<\/p>\n<p>The repository includes detailed setup instructions: <a href=\"https:\/\/github.com\/AI-4-Phi\/PhilLit\/blob\/main\/GETTING_STARTED.md\" rel=\"nofollow noopener\" target=\"_blank\">PhilLit \u2013 Getting Started<\/a>.<\/p>\n<p> Participate in our Validation Study: We are designing two studies to test whether the literature overviews are genuinely useful to experts. We need philosophers to test PhilLit on topics they already know well.<\/p>\n<p>For our first study, you will use PhilLit yourself and assess the reviews you get. We help you with the technical setup and pay the costs of running the reviews on the topics of your choice. You will provide structured feedback on accuracy, comprehensiveness, and usefulness.<\/p>\n<p>Our second study is for anyone, regardless of whether you are comfortable with the Terminal app, Python, or managing API keys. You will get a chance to provide feedback on the reviews that others generated.<\/p>\n<p>If you are interested in participating in either of these studies: sign up <a href=\"https:\/\/forms.gle\/tUFRJusdRPmn7vwz7\" rel=\"nofollow noopener\" target=\"_blank\">here<\/a> and we\u2019ll update you once we\u2019re ready to go.<\/p>\n<p><a href=\"#_ftnref1\" name=\"_ftn1\">[1]<\/a> This is the 50% task-completion time-horizon, that is, the maximum task duration (measured by how long it takes a human expert) at which an AI agent is predicted to succeed at least half of the time.<\/p>\n<p>Related:<\/p>\n<p>\u201c<a href=\"https:\/\/dailynous.com\/2022\/10\/24\/two-cultures-of-philosophy-ai-edition\/\" target=\"_blank\" rel=\"noopener nofollow\">Two Cultures of Philosophy: AI Edition<\/a>\u201d<br \/>\u201c<a href=\"https:\/\/dailynous.com\/2021\/07\/06\/shaping-the-ai-revolution-in-philosophy-guest-post\/\" rel=\"nofollow noopener\" target=\"_blank\">Shaping the AI Revolution in Philosophy<\/a>\u201d<br \/>\u201c<a href=\"https:\/\/dailynous.com\/2021\/03\/24\/hey-sophi-or-how-much-philosophy-will-computers-do\/\" rel=\"nofollow noopener\" target=\"_blank\">\u2018Hey Sophi\u2019, or How Much Philosophy Will Computers Do?<\/a>\u201d<br \/>\u201c<a href=\"https:\/\/dailynous.com\/2024\/03\/14\/reviving-the-philosophical-dialogue-with-large-language-models-guest-post\/\" target=\"_blank\" rel=\"noopener nofollow\">Reviving the Philosophical Dialogue with Large Language Models<\/a>\u201d<br \/>\u201c<a href=\"https:\/\/dailynous.com\/2025\/03\/13\/philosophers-develop-ai-based-teaching-tool-to-promote-constructive-disagreement-guest-post\/\" target=\"_blank\" rel=\"noopener nofollow\">Philosophers Develop AI-Based Teaching Tool to Promote Constructive Disagreement<\/a>\u201d<br \/>\u201c<a href=\"https:\/\/dailynous.com\/2026\/01\/19\/have-pen-laptop-and-chatgpt-will-publish-guest-post\/\" rel=\"nofollow noopener\" target=\"_blank\">Have Pen, Laptop, and ChatGPT, Will Publish<\/a>\u201c<\/p>\n","protected":false},"excerpt":{"rendered":"A pair of philosophers have developed a new research tool that uses AI to provide comprehensive and reliable&hellip;\n","protected":false},"author":2,"featured_media":472049,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[20],"tags":[62,276,277,49,48,61],"class_list":{"0":"post-472048","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-artificial-intelligence","8":"tag-ai","9":"tag-artificial-intelligence","10":"tag-artificialintelligence","11":"tag-ca","12":"tag-canada","13":"tag-technology"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/posts\/472048","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/comments?post=472048"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/posts\/472048\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/media\/472049"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/media?parent=472048"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/categories?post=472048"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/tags?post=472048"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}