{"id":55103,"date":"2025-09-17T11:37:35","date_gmt":"2025-09-17T05:37:35","guid":{"rendered":"https:\/\/www.enago.com\/academy\/?p=55103"},"modified":"2025-11-06T16:54:06","modified_gmt":"2025-11-06T10:54:06","slug":"ai-powered-peer-review-statistical-rigor","status":"publish","type":"post","link":"https:\/\/www.enago.com\/academy\/ai-powered-peer-review-statistical-rigor\/","title":{"rendered":"Rethinking Statistical Rigor in the Age of AI-Powered Peer Review"},"content":{"rendered":"<h2>Executive Summary<\/h2>\n<p>AI is reshaping <a href=\"https:\/\/www.enago.com\/publication-support-services\/peer-review-process\" data-internallinksmanager029f6b8e52c=\"115\" title=\"Peer Review\" target=\"_blank\" rel=\"noopener\">peer review<\/a>, particularly in statistical evaluation. Beyond text generation, AI can check consistency, rerun analyses, and flag questionable practices, easing reviewer fatigue. Used responsibly with bespoke, secure systems, AI can streamline first-pass checks, allowing reviewers to focus on interpretation, originality, and clinical significance.<\/p>\n<h2>Rethinking Statistical Rigor in the Age of AI-Powered Peer Review<\/h2>\n<p>Despite the widespread use of AI in various aspects of academic publishing by both authors and publishers, its full potential is probably not being realized. Frankly, the potential for generative AI packages based on large language models beyond the generation of text, checking grammar, spelling, the use of English and searching for information is probably unknown by many. For example, some who criticize generative AI for providing false information, hallucinating references and its occasional errors, even for checking spelling and grammar, have no idea that even common packages can be used to analyze and produce spreadsheets, run complex calculations and even perform <a href=\"https:\/\/www.enago.com\/publication-support-services\/statistical-analysis.htm\" data-internallinksmanager029f6b8e52c=\"121\" title=\"Statistical Analysis\" target=\"_blank\" rel=\"noopener\">statistical analysis<\/a>. These facilities of AI lead directly to the possibility of academic publishers using AI as an aid to review the statistical aspects of manuscripts reporting quantitative studies. AI is here, <a href=\"https:\/\/writers-camp.org\/2025\/05\/08\/the-transformative-impact-of-generative-ai-on-academic-writing\/\" target=\"_blank\" rel=\"noopener nofollow\">it is being used and it will not go away<\/a>, we must learn to use it responsibly.<\/p>\n<p>While packages such as ChatGPT should not be used to upload and analyze manuscripts submitted to academic journals for reasons of copyright infringement, I regularly use ChatGPT to analyze published articles by uploading the article and interrogating specific aspects of the reported study. For example, I am especially interested in the effectiveness of COVID-19 vaccines and ChatGPT can be used to calculate <a href=\"https:\/\/www.bmj.com\/content\/381\/bmj-2022-075289\/\" target=\"_blank\" rel=\"noopener nofollow\" class=\"broken_link\">absolute risk reduction and numbers needed to treat<\/a> for the vaccines where such studies commonly only report relative risk reduction and rarely report numbers needed to treat. The package will run the calculation, showing the steps in the calculation, meaning that the results can be checked.<\/p>\n<p>Statistical analysis has evolved rapidly in recent decades and sophisticated commercially available packages, such as SPSS\u00ae and SAS\u00ae, and public domain statistical packages such as <em>R<\/em>, have put the ability to conduct sophisticated and complex statistical analyses in the hands of non-statisticians. This has led to a large volume of studies, translated into manuscripts for publication, being submitted to academic journals. Often the same methods are used repeatedly without much consideration of how appropriate they are and without expert statistical oversight.<\/p>\n<p>As a result, there has been a concomitant increase in the scrutiny of quantitative studies by journal editors, an increase in the standards of statistical analysis required and increased rigor in peer reviewing processes. For example, where the outcomes of randomized clinical trials were traditionally reported using only the statistical significance to justify the effectiveness of interventions, there has been a move in favor of <a href=\"https:\/\/journals.sagepub.com\/doi\/10.1177\/1745691620958012\" target=\"_blank\" rel=\"noopener nofollow\" class=\"broken_link\">reporting effect sizes of differences between control and intervention groups<\/a> along with 95% confidence intervals to provide a better estimation of where the true difference between the groups lies.<\/p>\n<p>In addition to the frequentist statistical methods traditionally applied to clinical trials, there has been some advocacy for <a href=\"https:\/\/pmc.ncbi.nlm.nih.gov\/articles\/PMC9931171\/\" target=\"_blank\" rel=\"noopener nofollow\">the use of Bayesian methods<\/a>. Researchers using Bayesian methods are required to take prior knowledge into account in estimating how effective an intervention may be and then examining the outcome in terms of the expected and actual outcomes, the posterior outcome. They can also estimate the credible interval \u2013 a range of values \u2013 within which the effect of the intervention lies. However, while the absolute number of such studies using Bayesian methods is increasing, <a href=\"https:\/\/www.frontiersin.org\/journals\/pharmacology\/articles\/10.3389\/fphar.2025.1548997\/full\" target=\"_blank\" rel=\"noopener nofollow\">the percentage remains constant and very small<\/a>.<\/p>\n<p>With the volume of manuscripts submitted to academic <a href=\"https:\/\/pmc.ncbi.nlm.nih.gov\/articles\/PMC4317455\/\" target=\"_blank\" rel=\"noopener nofollow\">journals doubling approximately every 15 years<\/a> pre-COVID \u2013 and with the massive increase during COVID \u2013 academic <a href=\"https:\/\/www.nature.com\/articles\/d41586-018-07245-9\" target=\"_blank\" rel=\"noopener nofollow\">publishers and editors have long been turning to AI for solutions<\/a> to help reviewers cope with the burden of various aspects of the review process, including statistical review. Publishers and editors are also concerned with maintaining rapid copy flow and, since peer review is the rate-limiting step in the peer review process, they are always looking for ways to expedite reviews as rapidly as possible. Always, of course, with an eye on the quality of the reviews that are submitted. Academics are busier than ever and, as the number of manuscripts increases, they have less time to review the manuscripts they receive <a href=\"https:\/\/integranxt.com\/blog\/the-growing-burden-addressing-reviewer-fatigue\/\" target=\"_blank\" rel=\"noopener nofollow\">leading to reviewer fatigue<\/a>. This threatens the quality of peer reviewing, including statistical reviewing.<\/p>\n<p>AI has considerable potential to help streamline and accelerate the process of reviewing the statistical aspects of manuscripts. Some examples of where AI could be used include the following automated steps:<\/p>\n<ul>\n<li>Pre-screening: standards check (<a href=\"https:\/\/www.enago.com\/academy\/guestposts\/rogerwatson\/research-reporting-guidelines-evolution-impact-future\/\" target=\"_blank\" rel=\"noopener nofollow\">CONSORT, STROBE, PRISMA<\/a>)<\/li>\n<li><a href=\"https:\/\/trinka.ai\/features\/consistency-check\/\" data-internallinksmanager029f6b8e52c=\"137\" title=\"Consistency Check\" target=\"_blank\" rel=\"noopener\">Consistency Check<\/a>: cross-verify reported numbers with submitted databases<\/li>\n<li>Method Validation: ensure tests match study design<\/li>\n<li>Sample Size and Power: recalculate and confirm<\/li>\n<li>Checking veracity of and re-running submitted statistical coding such as Python\u00ae and <em>R<\/em><\/li>\n<li>Effect Size and Interpretation: statistical versus clinical meaning<\/li>\n<li>Bias and questionable research practices: look for p-hacking, outcome switching<\/li>\n<li>Bayesian statistics: verify whether Bayesian results are interpreted correctly and priors are clearly stated<\/li>\n<li>Draft reviewer notes: structured comments and feedback<\/li>\n<li>Decision support: confidence score and need for statistical reviewer<\/li>\n<\/ul>\n<p>Naturally, there are limitations and caveats to be addressed in the use of AI in the process of statistical reviewing. Commercial packages such as ChatGPT should not be used as the coding and algorithms are not in the public domain and anything which is uploaded to these packages may be shared with and used by others. Therefore, bespoke in-house AI packages are being developed by publishers. These will require training, involving qualified statisticians to ensure that the results they produce are both reliable and valid. In that light, AI should be seen as augmenting the statistical review process, not replacing expert statistical judgment.<\/p>\n<p>Taken together, these questions point to a future where AI is not a replacement for human reviewers but a catalyst for re-thinking how we balance speed, rigor, and transparency. If AI can reliably handle the first pass of statistical checking, flagging problems and re-running code, then editors and reviewers can focus more sharply on interpretation, originality, and clinical relevance. Whether this hybrid model becomes the new normal will depend on how well bespoke, trustworthy AI tools are developed, and how much confidence journals place in them to support\u2014not supplant\u2014the human judgement at the heart of peer review.<\/p>\n<h6><strong>Disclaimer:<\/strong> The opinions\/views expressed in this article exclusively represent the individual perspectives of the author. While we affirm the value of diverse viewpoints and advocate for the freedom of individual expression, we do not endorse derogatory or offensive comments against any caste, creed, race, or similar distinctions. For any concerns or further information, we invite you to contact us at academy@enago.com<\/h6>\n<div style=\"display:flex; gap:10px;justify-content:\" class=\"wps-pgfw-pdf-generate-icon__wrapper-frontend\">\n\t\t<a  href=\"https:\/\/www.enago.com\/academy\/wp-json\/wp\/v2\/posts\/55103?action=genpdf&amp;id=55103\" class=\"pgfw-single-pdf-download-button\" ><img data-src=\"https:\/\/www.enago.com\/academy\/wp-content\/plugins\/pdf-generator-for-wp\/admin\/src\/images\/PDF_Tray.svg\" title=\"Generate PDF\" style=\"width:auto; height:45px;\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" class=\"lazyload\"><\/a>\n\t\t<\/div>","protected":false},"excerpt":{"rendered":"<p>Executive Summary AI is reshaping peer review, particularly in statistical evaluation. Beyond text generation, AI&hellip;<\/p>\n","protected":false},"author":12958,"featured_media":55110,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"om_disable_all_campaigns":false,"footnotes":""},"categories":[1926],"tags":[1873],"ppma_author":[1980],"class_list":["post-55103","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-thought-leadership","tag-research-statistics"],"better_featured_image":{"id":55110,"alt_text":"peer review","caption":"","description":"","media_type":"image","media_details":{"width":910,"height":340,"file":"2025\/09\/FeatureImages-3-1.png","filesize":474315,"sizes":{},"image_meta":{"aperture":"0","credit":"","camera":"","caption":"","created_timestamp":"0","copyright":"","focal_length":"0","iso":"0","shutter_speed":"0","title":"","orientation":"0","keywords":[]}},"post":55103,"source_url":"https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2025\/09\/FeatureImages-3-1.png"},"acf":{"faq_main_heading":"","faq_heading_one":"","faq_heading_two":"","faq_heading_three":"","faq_heading_four":"","faq_heading_five":"","faq_heading_six":"","faq_description_one":"","faq_description_two":"","faq_description_three":"","faq_description_four":"","faq_description_five":"","faq_description_six":""},"views":424,"single_webinar_page_date":null,"single_webinar_page_time":null,"session_agenda":null,"who_should_attend_this_session":null,"about_the_speaker_field":null,"co-webinar-sec":null,"co_webinar_sec_one":null,"speaker-name":null,"webinar-date":null,"webinar-time":null,"webinar-s-image":null,"custum_webinar_category":null,"authors":[{"term_id":1980,"user_id":12958,"is_guest":0,"slug":"rogerw","display_name":"Dr. Roger Watson","avatar_url":{"url":"https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2025\/03\/image-10.png","url2x":"https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2025\/03\/image-10.png"},"author_category":"1","user_url":"","last_name":"Watson","first_name":"Dr. Roger","job_title":"","description":"Roger Watson is an internationally recognized nursing academic and editor. He is currently Editor-in-Chief of Nurse Education in Practice (since January 2021) and Director of Yould Publications Ltd (since 2005), which provides expert training and advice on writing for publication.\r\n\r\nPreviously, he was Professor of Nursing at the University of Hull (2012\u20132022) and has held faculty positions at the University of Sheffield and University of Western Sydney, among others. His editorial leadership includes serving as Editor-in-Chief of the Journal of Advanced Nursing (2012\u20132020) and Editor-in-Chief of the Journal of Clinical Nursing (2003\u20132011).\r\nRoger Watson holds a PhD in biochemistry from the University of Sheffield and initially trained as a nurse at St George\u2019s Hospital, London, following a biology degree from The University of Edinburgh. His research focuses on the feeding and nutritional issues of older adults with dementia.\r\nHe has served on national research assessment panels in the UK, including the 2008 Research Assessment Exercise and 2014 Research Excellence Framework. He holds honorary positions in Ireland and China and was inducted into the Sigma Theta Tau International Nurse Researcher Hall of Fame in 2017.\r\nHis current work continues to shape nursing education, research, and publication worldwide."}],"_links":{"self":[{"href":"https:\/\/www.enago.com\/academy\/wp-json\/wp\/v2\/posts\/55103","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.enago.com\/academy\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.enago.com\/academy\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.enago.com\/academy\/wp-json\/wp\/v2\/users\/12958"}],"replies":[{"embeddable":true,"href":"https:\/\/www.enago.com\/academy\/wp-json\/wp\/v2\/comments?post=55103"}],"version-history":[{"count":4,"href":"https:\/\/www.enago.com\/academy\/wp-json\/wp\/v2\/posts\/55103\/revisions"}],"predecessor-version":[{"id":56806,"href":"https:\/\/www.enago.com\/academy\/wp-json\/wp\/v2\/posts\/55103\/revisions\/56806"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.enago.com\/academy\/wp-json\/wp\/v2\/media\/55110"}],"wp:attachment":[{"href":"https:\/\/www.enago.com\/academy\/wp-json\/wp\/v2\/media?parent=55103"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.enago.com\/academy\/wp-json\/wp\/v2\/categories?post=55103"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.enago.com\/academy\/wp-json\/wp\/v2\/tags?post=55103"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/www.enago.com\/academy\/wp-json\/wp\/v2\/ppma_author?post=55103"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}