Época Especial
Projects may be made individually or in groups of at most 3 students, that have to deliver a final report through Fenix.
- Project description (posted July 10th )
- Report template
- First dataset
- Second dataset
Época Normal
Projects have to be made in groups of 3 students, that have to deliver a final report through Fenix.- Project description (posted Sep 21st )
- Report template
- First dataset
- Second dataset
FAQ
Please always consult the FAQ before posting questions to the faculty hosts.1. Is there any preference on the template to use (Latex vs Word)? No. All students just have to deliver a PDF file with their report. There is no preference for one of them. As usual is strongly recommended that students use the easiest for them.
2. What about report structure? Report should follow the ACM-two column template available above, and should present the work done for covering each discovery step. Starting from data exploration, to evaluation and critical analysis. Both datasets should be covered in parallel.
3. How do we deal with missing values in scikit-learn? scikit-learn doesn't deal with missing values, which means that we are not able to compare the performance when nothing is changed. In order to serve as a baseline, you should create a new value (such as UNKNOWN or NA) and store the performance achieved. Note that the same transformation is required for the test dataset.
4. What about creativity? This kind of project requires the systematic analysis of the data, exploring different pre-processing techniques and learning methods parameterisations, which isn't a very creative task. However, along with that exploration, is usual to find a subproblem which could benefit from a different approach, involving some combination of techniques or a particular manipulation. In some manner, we are expecting a somehow "out-of-the-box" solution. The exploration of just another learning technique won't be rewarded.