Talk:Data preprocessing
| This article is rated Stub-class on Wikipedia's content assessment scale. It is of interest to the following WikiProjects: | |||||||||||
| |||||||||||
Context
Is this just about data pre-processing in a Machine Learning context (apparently the subject of the first reference) or should it also cover data pre-processing in a data mining context (apparently the subject of the second reference)? If it is just about the former, then I have some concerns about notability and self-promotion - given that the article was apparently created by one of the authors of the first reference. --RichardVeryard 03:02, 3 September 2007 (UTC)
- Machine Learning and Data Mining coincide to a very great degree. So, it is about data set preprocessing in both contexts —Preceding unsigned comment added by 194.219.216.110 (talk) 22:54, 7 February 2009 (UTC)
- A very questionable article. The reference to 'Data Preprocessing for Supervised Leaning' is even more questionable. The referred article has a spelling mistake in the title, several spelling and grammar mistakes in the abstract and by no means can be considered an overview or a landmark paper in the area. I think that the whole article should be deleted.
128.189.119.222 (talk) 01:26, 5 June 2010 (UTC)
- I've only just spotted the anonymous reply to my original question. Machine learning and data mining are usually regarded as separate disciplines, and I note that an introduction about data mining has been added since I last looked at the article. However, the article still needs a lot of work. RichardVeryard (talk) 15:11, 8 June 2010 (UTC)
What is data preprocessing?
The articles fails to explain one thing: What data preprocessing actually is. The term should be defined in the very first sentence of the article. —Kri (talk) 17:38, 15 February 2017 (UTC)
Poor writing
The 'Data mining' and 'Semantic data preprocessing' sections are plagued with poor prose and grammar errors - "The reason why a user transforms existing files into a new one is because of many reasons", "Here is the idea [...]", "[...] it make sense [...]", "[...] quantifiers like true positives ,true negatives,False positives and false negatives [...]" (badly-placed commas), "Later it was recognized, that for machine learning [...]"...
SpanishDuke (talk) 23:44, 10 December 2020 (UTC)
Multiple Changes
Hi all, I've just made multiple changes to the Data pre-processing (now Data Preprocessing, see Move talk page) in an attempt to improve the article. These have been separated out in order to make it easier to see the reasoning behind each change: happy to discuss if there are any issues but please avoid reverting all changes if possible. These changes include grammar and capitalisation fixes, the removal of duplicate or unnecessary information, and some reformatting. I've also added some tags for issues that should be addressed in the future. EditorOnOccasion (talk) 11:43, 7 August 2023 (UTC)
"Corrected a link"
I have made a change stating that I corrected a link but I want to be sure AhmedZeedy45 (talk) 08:39, 23 September 2024 (UTC)
Wiki Education assignment: Introduction to Technical Writing
This article was the subject of a Wiki Education Foundation-supported course assignment, between 19 August 2025 and 13 December 2025. Further details are available on the course page. Student editor(s): Atang-official (article contribs).
— Assignment last updated by Atang-official (talk) 17:52, 23 September 2025 (UTC)
Content Disclaimer
Informasi ini disarikan dari Wikipedia dan disajikan kembali untuk tujuan edukasi. Konten tersedia di bawah lisensi CC BY-SA 3.0. Kami tidak bertanggung jawab atas ketidakakuratan data yang bersumber dari kontribusi publik tersebut.
- The information displayed on this website is sourced in part or in whole from Wikipedia and has been adapted for the purpose of restating it. We strive to provide accurate and relevant information, however:
- There is no guarantee of absolute accuracy. Wikipedia is an open, collaborative project that can be edited by anyone, so information is subject to change.
- It is not intended to constitute professional advice. The content displayed is for informational and educational purposes only. For important decisions (e.g., medical, legal, or financial), please consult a professional.
- Content copyright. Wikipedia is licensed under the Creative Commons Attribution-ShareAlike License (CC BY-SA). This means that content may be reused with appropriate attribution and shared under a similar license.
- Responsible use. Any risk arising from the use of information from this website is entirely the responsibility of the user.