Week 9
The plan for this week was to test continuous and categorical data for gradient boosting and integrate regression and classification tree on larger datasets.
Day 1:
- Did a study on the dataset provided by Roger Dev. It had 5000 records, 52 attributes, and 7 classes.
- Developed script to read this data into ECL and verify the algorithm.
- On a parallel note spent this day studying Scopus data as part of my parallel project in connecting legislative and research documents.
Day 3:
- The dataset took a large time to run.
- Spent the rest of the day debugging the problem.
- Fixed the issue.
Day 4:
- Wrote a naive scrapper to parse data from Scopus.
- Need to fetch large data. Planning over the weekend.
Day 5:
- Started working on slides for Tech Talk on Aug 1st
No comments:
Post a Comment