{"id":1259,"date":"2016-06-21T12:10:14","date_gmt":"2016-06-21T06:40:14","guid":{"rendered":"https:\/\/www.enago.com\/academy\/?p=1259"},"modified":"2018-05-21T18:36:30","modified_gmt":"2018-05-21T13:06:30","slug":"data-massaging-in-scientific-research","status":"publish","type":"post","link":"https:\/\/www.enago.com\/academy\/data-massaging-in-scientific-research\/","title":{"rendered":"Data Massaging in Scientific Research: When Does It Go Too Far?"},"content":{"rendered":"<p>One of the joys of research is <a href=\"https:\/\/www.enago.com\/academy\/what-is-big-data-are-we-better-off-with-it\/\" target=\"_blank\" rel=\"noopener\">feeding a mass of data into a computer program<\/a>, pushing the return button, and seeing a graph magically appear. A straight line! That ought to give some nice kinetic data. But on second glance the plot is not quite satisfactory. There are some annoying outlying points that skew the rate constant away from where it ought to be. No problem. This is just a plot of the raw data. Time to clean it up. Let\u2019s exclude all data that falls outside the three sigma range. There, that helped. Tightened that error bar and moved the constant closer to where it should be. Let\u2019s try a two sigma filter. Even better! Now that\u2019s some data that\u2019s publishable.<\/p>\n<p>You have just engaged in the venerable practice of data massaging. A common practice, but should it be?<\/p>\n<p>Every scientist will agree that you should not choose data\u2014selecting data that supports your argument and ignoring data that does not. But even here there are some grey areas. Not every reaction system gives clean kinetics. Is there anything wrong with <a href=\"https:\/\/www.enago.com\/academy\/does-it-matter-when-you-analyze-yourour-research-data\/\" target=\"_blank\" rel=\"noopener\">studying a system that can be analyzed<\/a>, rather than beating your head against the wall of an intractable system? Gregor Mendel didn\u2019t think so. In his studies of plant heredity, he did not randomly sample data from every plant in his garden. He found that some plants gave easily analyzed data while\u00a0others did not. Naturally, he studied those that gave results that made sense. But among those systems he studied, he did not pick and choose his data. Even some of the best scientists will apply what they consider rigorous statistical filters to improve the data, to clean it up, to tighten the error bars. Is this acceptable?<\/p>\n<p>Some statisticians say it is not. They argue that no data should be excluded on the basis of statistics. Statistics may point out which data should be further scrutinized but no data should be excluded on the basis of statistics. I agree with this point of view. When you \u201cimprove\u201d data, you exclude data. Should not all the data be available to the public? If there is a wide spread in the data, is not that fact in itself a valuable piece of information? A reader ought to know how reliable the data is and not have to guess how good it was before the two sigma filter was applied.<\/p>\n<p>Is data massaging unethical? Not if you clearly state what you have done. But the practice is unwise and ought to be discouraged.<\/p>\n<h3>Famous Example of Data Massaging<\/h3>\n<p>Robert Millikan\u2019s famous 1909 oil drop experiment measured the value of the elementary charge of an electron to within 0.5%. Or did it? Some historians claim that before he \u201ccleaned up\u201d his data, his standard error was 2%, four times as high. <a href=\"http:\/\/en.wikipedia.org\/wiki\/Oil_drop_experiment#Fraud_allegations\" target=\"_blank\" rel=\"nofollow noopener\">http:\/\/en.wikipedia.org\/wiki\/Oil_drop_experiment#Fraud_allegations<\/a><\/p>\n<div style=\"display:flex; gap:10px;justify-content:\" class=\"wps-pgfw-pdf-generate-icon__wrapper-frontend\">\n\t\t<a  href=\"https:\/\/www.enago.com\/academy\/wp-json\/wp\/v2\/posts\/1259?action=genpdf&amp;id=1259\" class=\"pgfw-single-pdf-download-button\" ><img data-src=\"https:\/\/www.enago.com\/academy\/wp-content\/plugins\/pdf-generator-for-wp\/admin\/src\/images\/PDF_Tray.svg\" title=\"Generate PDF\" style=\"width:auto; height:45px;\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" class=\"lazyload\"><\/a>\n\t\t<\/div>","protected":false},"excerpt":{"rendered":"<p>One of the joys of research is feeding a mass of data into a computer&hellip;<\/p>\n","protected":false},"author":4,"featured_media":1282,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"om_disable_all_campaigns":false,"footnotes":""},"categories":[747,2],"tags":[1452],"ppma_author":[1895],"class_list":["post-1259","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-content-structure","category-academic-writing","tag-manuscript-drafting-tips"],"better_featured_image":{"id":1282,"alt_text":"Data Massaging","caption":"","description":"","media_type":"image","media_details":{"width":930,"height":300,"hwstring_small":"height='41' width='128'","file":"2016\/05\/Data-Massaging-When-does-it-Go-Too-Far-.jpg","sizes":{"thumbnail":{"file":"Data-Massaging-When-does-it-Go-Too-Far--170x150.jpg","width":170,"height":150,"mime-type":"image\/jpeg","source_url":"https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2016\/05\/Data-Massaging-When-does-it-Go-Too-Far--170x150.jpg"},"medium":{"file":"Data-Massaging-When-does-it-Go-Too-Far--470x152.jpg","width":470,"height":152,"mime-type":"image\/jpeg","source_url":"https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2016\/05\/Data-Massaging-When-does-it-Go-Too-Far--470x152.jpg"},"medium_large":{"file":"Data-Massaging-When-does-it-Go-Too-Far--768x248.jpg","width":768,"height":248,"mime-type":"image\/jpeg","source_url":"https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2016\/05\/Data-Massaging-When-does-it-Go-Too-Far--768x248.jpg"},"better-amp-small":{"file":"Data-Massaging-When-does-it-Go-Too-Far--100x100.jpg","width":100,"height":100,"mime-type":"image\/jpeg","source_url":"https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2016\/05\/Data-Massaging-When-does-it-Go-Too-Far--100x100.jpg"},"better-amp-normal":{"file":"Data-Massaging-When-does-it-Go-Too-Far--260x200.jpg","width":260,"height":200,"mime-type":"image\/jpeg","source_url":"https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2016\/05\/Data-Massaging-When-does-it-Go-Too-Far--260x200.jpg"},"better-amp-large":{"file":"Data-Massaging-When-does-it-Go-Too-Far--450x300.jpg","width":450,"height":300,"mime-type":"image\/jpeg","source_url":"https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2016\/05\/Data-Massaging-When-does-it-Go-Too-Far--450x300.jpg"},"publisher-tb1":{"file":"Data-Massaging-When-does-it-Go-Too-Far--86x64.jpg","width":86,"height":64,"mime-type":"image\/jpeg","source_url":"https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2016\/05\/Data-Massaging-When-does-it-Go-Too-Far--86x64.jpg"},"publisher-sm":{"file":"Data-Massaging-When-does-it-Go-Too-Far--210x136.jpg","width":210,"height":136,"mime-type":"image\/jpeg","source_url":"https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2016\/05\/Data-Massaging-When-does-it-Go-Too-Far--210x136.jpg"},"publisher-mg2":{"file":"Data-Massaging-When-does-it-Go-Too-Far--279x220.jpg","width":279,"height":220,"mime-type":"image\/jpeg","source_url":"https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2016\/05\/Data-Massaging-When-does-it-Go-Too-Far--279x220.jpg"},"publisher-md":{"file":"Data-Massaging-When-does-it-Go-Too-Far--357x210.jpg","width":357,"height":210,"mime-type":"image\/jpeg","source_url":"https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2016\/05\/Data-Massaging-When-does-it-Go-Too-Far--357x210.jpg"},"publisher-lg":{"file":"Data-Massaging-When-does-it-Go-Too-Far--750x300.jpg","width":750,"height":300,"mime-type":"image\/jpeg","source_url":"https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2016\/05\/Data-Massaging-When-does-it-Go-Too-Far--750x300.jpg"},"publisher-tall-sm":{"file":"Data-Massaging-When-does-it-Go-Too-Far--180x217.jpg","width":180,"height":217,"mime-type":"image\/jpeg","source_url":"https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2016\/05\/Data-Massaging-When-does-it-Go-Too-Far--180x217.jpg"},"publisher-tall-lg":{"file":"Data-Massaging-When-does-it-Go-Too-Far--267x300.jpg","width":267,"height":300,"mime-type":"image\/jpeg","source_url":"https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2016\/05\/Data-Massaging-When-does-it-Go-Too-Far--267x300.jpg"},"publisher-tall-big":{"file":"Data-Massaging-When-does-it-Go-Too-Far--368x300.jpg","width":368,"height":300,"mime-type":"image\/jpeg","source_url":"https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2016\/05\/Data-Massaging-When-does-it-Go-Too-Far--368x300.jpg"}},"image_meta":{"aperture":"0","credit":"","camera":"","caption":"","created_timestamp":"0","copyright":"","focal_length":"0","iso":"0","shutter_speed":"0","title":"","orientation":"0","keywords":[]}},"post":1259,"source_url":"https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2016\/05\/Data-Massaging-When-does-it-Go-Too-Far-.jpg"},"acf":{"faq_main_heading":"","faq_heading_one":"","faq_heading_two":"","faq_heading_three":"","faq_heading_four":"","faq_heading_five":"","faq_heading_six":"","faq_description_one":"","faq_description_two":"","faq_description_three":"","faq_description_four":"","faq_description_five":"","faq_description_six":""},"views":1006,"single_webinar_page_date":null,"single_webinar_page_time":null,"session_agenda":null,"who_should_attend_this_session":null,"about_the_speaker_field":null,"co-webinar-sec":null,"co_webinar_sec_one":null,"speaker-name":null,"webinar-date":null,"webinar-time":null,"webinar-s-image":null,"custum_webinar_category":null,"authors":[{"term_id":1895,"user_id":4,"is_guest":0,"slug":"editor","display_name":"Enago Academy","avatar_url":"https:\/\/secure.gravatar.com\/avatar\/2ef4bc47f3ceaa56f5eb3b26f9520fad298ba36ede4f86315997ffb45db37a1f?s=96&d=identicon&r=g","author_category":"","user_url":"","last_name":"Academy","first_name":"Editor","job_title":"","description":"Enago Academy, the knowledge arm of Enago, offers comprehensive and up-to-date resources on academic research and scholarly publishing to all levels of scholarly professionals: students, researchers, editors, publishers, and academic societies. It is also a popular platform for networking, allowing researchers to learn, share, and discuss their experiences within their network and community. The team, which comprises subject matter experts, academicians, trainers, and technical project managers, are passionate about helping researchers at all levels establish a successful career, both within and outside academia."}],"_links":{"self":[{"href":"https:\/\/www.enago.com\/academy\/wp-json\/wp\/v2\/posts\/1259","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.enago.com\/academy\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.enago.com\/academy\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.enago.com\/academy\/wp-json\/wp\/v2\/users\/4"}],"replies":[{"embeddable":true,"href":"https:\/\/www.enago.com\/academy\/wp-json\/wp\/v2\/comments?post=1259"}],"version-history":[{"count":0,"href":"https:\/\/www.enago.com\/academy\/wp-json\/wp\/v2\/posts\/1259\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.enago.com\/academy\/wp-json\/wp\/v2\/media\/1282"}],"wp:attachment":[{"href":"https:\/\/www.enago.com\/academy\/wp-json\/wp\/v2\/media?parent=1259"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.enago.com\/academy\/wp-json\/wp\/v2\/categories?post=1259"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.enago.com\/academy\/wp-json\/wp\/v2\/tags?post=1259"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/www.enago.com\/academy\/wp-json\/wp\/v2\/ppma_author?post=1259"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}