Personal blog written from scratch using Node.js, Bootstrap, and MySQL. https://jrtechs.net
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

95 lines
4.3 KiB

  1. # Importing and Cleaning Data
  2. # Data Visualization
  3. # Analysis
  4. # Report
  5. ## Abstract
  6. The way in which a team trains is critical in ensuring that everyone performs at their peak performance
  7. during a game. In order to effectively train a team to optimize their gameday performance, it would make
  8. intuitive sense to monitor their training data with respect to their perceived fatigue. Through analyzing
  9. time series data provided by our partnering women’s rugby team, it was observed that this team altered
  10. their training schedule close to games. Although there is some relationship between the two in the long
  11. run, our attempts at modeling fatigue and work load in the short run suggests little to no correlation using
  12. linear regressions. This suggests that modeling fatigue is a more complex problem including a slew of factors
  13. both psychological and physical which spans over a period of time; coaches should pay attention not only to
  14. training but also sleep and mental wellness for happy and competitive teams. To most effectively forecast an
  15. individual’s performance during a game, we propose a system which takes into account physiological factors
  16. such as desire and physical factors such as sleep, soreness and amount of training.
  17. ## Methodology
  18. We employed a wide range of techniques for establishing our models and hypotheses, including smoothing
  19. of time series Information, testing of hypotheses based on a prior understanding of the domain, plotting
  20. and visually analyzing pairs of variables, and artificial intelligence algorithms that found various linear and
  21. nonlinear patterns in the dataset. Coefficients of determination were calculated to determine fitness of linear
  22. models, and F1 scores were analyzed to validate complex nonlinear classification models.
  23. ## Modeling Fatigue
  24. Fatigue can be effectively and linearly modeled using daily records and time series moving
  25. averages of acute chronic ratios, daily workload, sleep quality, and sleep hours.
  26. This means that instead of only lowering training before competitions, coaches
  27. should put focus on preparing the athletes physically and mentally through a
  28. combination of measures with a focus on sleep.
  29. | Iterations/100 | Mean Squared Error |
  30. | ----------- | ----------- |
  31. | 1 | 90.4998 |
  32. | 11 | 1.0265 |
  33. | 21 | 0.9604 |
  34. | 31 | 0.8671 |
  35. | 41 | 0.7838 |
  36. |100 | 0.0925 |
  37. Sample Size: 304864
  38. Final R2: 0.532
  39. ## Predicting Performance
  40. Trivially, performance of an individual cannot be modeled using simple linear regressions
  41. only involving one factors. We therefore developed and optimized a deep neural
  42. network to capture the patterns involving fatigue, sleep, and self-rated performance.
  43. The structure of the network is a 3-layer (input, output, and a hidden layer)
  44. sigmoid classifier that was trained on batches of 32 samples from players with
  45. respect to features: normalized perceived fatigue, sliding average of
  46. perceived fatigue, sliding average over sleep hours, and the perceived sleep quality of
  47. the players. It is optimized through the Adam optimizer with a learning rate of
  48. .005 and cross entropy to calculate the loss between the logits and labels.
  49. The logits of the work are a confidence output on which class the network
  50. feels the sample most likely belongs to, the real value of which is the
  51. classification of perceived performance by the player. Through this method,
  52. we can show a correlation between fatigue, sleep, and self-rated performance,
  53. as well as a means to predict this self-rate performance based off of fatigue
  54. and self-perceived sleep quality.
  55. Results with LR=.01, Batch=32:
  56. - Accuracy before training: 20.44388%
  57. - Loss after step 49: .531657
  58. - Accuracy after training: 74.846625%
  59. - F1 Score: .94
  60. ![](media/datafest/network.png)
  61. ## Future Work
  62. With more data to to test with we can further improve and validate out models. With historical data from
  63. other teams we can take our analysis one step further. Based on the training, performance, and fatigue
  64. information from other teams we can use that to create a model to make a recommendation for our team’s
  65. training. This model would be able to make recommendations for our training intensity leading up to a
  66. game. Since this will be heavily dealing with multivariate time series data leading up to a game, using a Long
  67. Short-term Network (LSTM) would bring promising results.