CSE4063 Project #2- Frequent Pattern Mining and Clustering Solved

30.00 $

Category:

Description

Rate this product

 

No Group Members Dataset [Rows x Columns] Presentation Slot
1 Ayberk Ömer Altuntabak *

Abdulhalik Şensin

Amela Karmaj

Absenteeism at work [740×21] 14.01.2021

Thursday 12.00 – 12.17

2 Emin Kağan Kadıoğlu *

Ayşenur Yılmaz

Mert Mengü

Anuran Calls (MFCCs) [7,195×22] 14.01.2021

Thursday 12.20 – 12.37

3 Mehmet Nusret Odabaşı *

Abbas Kutay

Orhan Fatih Bayazıt

Apartment for rent classified [10,000×22] 14.01.2021

Thursday 12.40 – 12.57

4 Ahmet Enes Gündüz *

Hakan Yalçın

Muhammed Fethullah Eroğlu

BLE RSSI Dataset for Indoor localization and Navigation [6,611×15] 14.01.2021

Thursday 13.00 – 13.17

5 Furkan Akman *

Burak Fidan

Mustafa Sertaç Öztürk

Codon usage [13,028×69] 14.01.2021

Thursday 13.20 – 13.37

6 Ferihan Çabuk *

Ali Berat Çetin

Muhammed İsa Akbaba

Estimation of obesity levels based on eating habits and physical condition [2,111×17] 14.01.2021

Thursday 13.40 – 13.57

7 Halid Seyfullah Sert *

Mert İlik

 

Facebook Live Sellers in Thailand [7,051×12] 19.01.2021

Tuesday 12.00 – 12.17

8 Sedanur Kara *

Berke Şahin

Sinem Onal

Geo-Magnetic field and WLAN dataset for indoor localisation from wristband and smartphone [153,540×25] 19.01.2021

Tuesday 12.20 – 12.37

9 Diala Jassem M.B.J. *

Münevver Sueda Kocatürk

Nurhande Akyüz

Mice Protein Expression [1,080×82] 19.01.2021

Tuesday 12.40 – 12.57

10 Ahmet Hakan Ekşi *

Belgin Taştan

Kevser İldeş

Motion Capture Hand Postures [78,095×38] 16.01.2021

Saturday 12.00 – 12.17

11 Deniz Arda Gürhizin *

Can Berk Durmuş

Tarkan Batar

Online Shoppers Purchasing Intention Dataset [12,330×18] 16.01.2021

Saturday 12.20 – 12.37

12 Zahide Gür Taştan *

Merve Ayer

Zeynep Naz Akyokuş

Sales_Transactions_Dataset_Weekly [811×53] 16.01.2021

Saturday 12.40 – 12.57

13 Cem Güleç *

Buğra Akdeniz

Kadir Hızarcı

Shill Bidding Dataset [6,321×13] 16.01.2021

Saturday 13.00 – 13.17

14 İlker Fener *

Doğukan Deniz

Halil İbrahim Şimşek

South German Credit (UPDATE) [1,000×21] 16.01.2021

Saturday 13.20 – 13.37

15 Osman Mantıcı *

Buse Batman

Fatmanur Özdemir

Turkiye Student Evaluation [5,820×33] 16.01.2021

Saturday 13.40 – 13.57

  1. The first students indicated by * sign are the group representatives.
  2. Learn / Get information about your data.
  • Python Platform & Environment
    1. Get a platform/environment for python work on, if you do not have any. Install it on your computer.
    2. You may use any libraries you want; however, you should have complete understanding to use and explain it in demo sessions.
    3. Implement your work with your own code as possible as you can.
  • Model Construction: Frequent Pattern Mining & Clustering Analysis
    1. Do the data preprocessing steps, if required.
    2. Use your dataset to construct 3 frequent pattern mining models as follows: i) ii) FP-Growth.
    3. Use your dataset to construct 3 clustering analysis methods as follows:
    4. i) K-Means. ii)
  • Implementation & Model Evaluation
    1. Implement six algorithms above on your dataset using python.
    2. Compare the performance and the results of three frequent pattern mining algorithms on your dataset. Discuss the results.
    3. Compare the performance and the results of three clustering analysis algorithms on your dataset. Discuss the results.
    4. Compare the performances of your classifiers with performances of the relevant papers given on the site.
  • Presentation
    1. You are going to present your work done online in 12 minutes at the time slot reserved for your group. Group members should equally participate the presentation. See the table above.
    2. Prepare a presentation file discussing the details of your work done and results of the classifiers.
    3. Your presentation should contain the following parts at least:
    4. i) Problem definition ii) Dataset
      • Information about the dataset.
      • Number of instances, columns, etc.
    5. Data preprocessing, cleaning
      • Missing values, and how you conduct on these.
      • Transformations and normalizations.
    6. Python implementation for each of the 6 algorithms
      • IDE/environment used.
      • Implementation details.
      • Libraries used.
    7. Model evaluation & performance results
      • Performance measures.
      • Comparison of all 6 algorithms.
  • Conclusion
  • Demo with Presentation
    1. You are going to demonstrate your work done online in 5 minutes after your presentation. See the table above.
    2. You are going to have 17 minutes in total for your group’s session (12 minutes for presentation, and 5 minutes for demonstration).
    3. Please keep in mind that all the presentation and demo sessions will be recorded.
    4. All the students should attend all sessions.
  • Related Questions & Answers
    1. Prepare 5 questions and answers related to your topic. These questions may be asked to other students.
    2. Question types can be multiple choice (single or multiple selection), fill in the blanks, matching, essay, etc.
    3. Prepare a presentation file with 11 slides consisting these 5 questions and answers. First slide will be used for your topic and group members’ info. Use 1 slide per each question, and 1 slide per each answer.
  • Evaluation
    1. Your grade related to project #2 will cover 10% of your total grade at least; may increase subject to coronavirus issues.
    2. Evaluation will be done out of 100 points:
    3. i) [4 pts] Data set understanding. ii) [4 pts] Data preprocessing. iii) [20 pts] Implementation of frequent pattern algorithms. iv) [20 pts] Implementation of clustering analysis algorithms. v) [14 pts] Results, comparison, discussion & conclusion.
    4. vi) [20 pts] Presentation quality. vii) [8 pts] Demo quality. viii) [10 pts] Questions & answers quality.
  • Submission
    1. You are going to submit the followings:
      1. Python codes implemented.
      2. Presentation file.
  • Questions & answers presentation file.
  1. Write the following sentence in a text file: “We hereby swear that the work done on this project is totally our own; and on our honor, we have neither given nor received any unauthorized and/or inappropriate assistance for this project. We understand that by the school code, violation of these principles will lead to a zero grade and is subject to harsh discipline issues.” Rename it as “we_swear.txt” and include this file in the zip submission file.
  2. Only one of the group members (i.e. group representative, in short “GrRep”) is going to submit the project using GrRep’s info all the time. However, all group members should have a complete and comprehensive understanding of all the work done for all tasks and steps of the project.
  3. Zip all your documents into a single file using filename GrRepStudentNumber_P2.zip (e.g. 150118123_P2.zip) and submit it to the site http://ues.marmara.edu.tr before deadline.
  4. In case of any form of copying and cheating on solutions, all parts will get ZERO points. You should submit your own work. In case of any forms of cheating or copying, both giver and receiver are equally culpable and suffer equal penalties. All types of plagiarism will result in zero points from the homework.
  5. If case of using your handwriting, your handwriting should be readable, clear and neat. If possible, do not use any handwriting.
  6. Do not send project submissions through e-mail. E-mail attachments will not be accepted as valid submissions.
  7. You are responsible for making sure you are turning in the right file, and that it is not corrupted in anyway. We will not allow resubmissions if you turn in the wrong file, even if you can prove that you have not modified the file after the deadline.
  8. Grade evaluation may be done on selected parts of the project, so try to complete all parts of your project successfully.
  9. No late submissions will be accepted.

 

  • Frequent-Pattern-Mining-and-Clustering-6qpis1.zip