Personal blog written from scratch using Node.js, Bootstrap, and MySQL. https://jrtechs.net
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

115 lines
4.4 KiB

  1. https://rpubs.com/patrickoster/481524
  2. https://rpubs.com/patrickoster/481544
  3. # Importing and Cleaning Data
  4. # Data Visualization
  5. # Analysis
  6. # Report
  7. ## Abstract
  8. The way in which a team trains is critical in ensuring that everyone
  9. performs at their peak performance during a game. In order to
  10. effectively train a team to optimize their gameday performance, it
  11. would make intuitive sense to monitor their training data with respect
  12. to their perceived fatigue. Through analyzing time series data
  13. provided by our partnering women’s rugby team, it was observed that
  14. this team altered their training schedule close to games. Although
  15. there is some relationship between the two in the long run, our
  16. attempts at modeling fatigue and work load in the short run suggests
  17. little to no correlation using linear regressions. This suggests that
  18. modeling fatigue is a more complex problem including a slew of factors
  19. both psychological and physical which spans over a period of time;
  20. coaches should pay attention not only to training but also sleep and
  21. mental wellness for happy and competitive teams. To most effectively
  22. forecast an individual’s performance during a game, we propose a
  23. system which takes into account physiological factors such as desire
  24. and physical factors such as sleep, soreness and amount of training.
  25. ## Methodology
  26. We employed a wide range of techniques for establishing our models and
  27. hypotheses, including smoothing of time series Information, testing of
  28. hypotheses based on a prior understanding of the domain, plotting and
  29. visually analyzing pairs of variables, and artificial intelligence
  30. algorithms that found various linear and nonlinear patterns in the
  31. dataset. Coefficients of determination were calculated to determine
  32. fitness of linear models, and F1 scores were analyzed to validate
  33. complex nonlinear classification models.
  34. ## Modeling Fatigue
  35. Fatigue can be effectively and linearly modeled using daily records
  36. and time series moving averages of acute chronic ratios, daily
  37. workload, sleep quality, and sleep hours. This means that instead of
  38. only lowering training before competitions, coaches should put focus
  39. on preparing the athletes physically and mentally through a
  40. combination of measures with a focus on sleep.
  41. | Iterations/100 | Mean Squared Error |
  42. | ----------- | ----------- |
  43. | 1 | 90.4998 |
  44. | 11 | 1.0265 |
  45. | 21 | 0.9604 |
  46. | 31 | 0.8671 |
  47. | 41 | 0.7838 |
  48. |100 | 0.0925 |
  49. Sample Size: 304864
  50. Final R2: 0.532
  51. ## Predicting Performance
  52. Trivially, performance of an individual cannot be modeled using simple
  53. linear regressions only involving one factors. We therefore developed
  54. and optimized a deep neural network to capture the patterns involving
  55. fatigue, sleep, and self-rated performance.
  56. The structure of the network is a 3-layer (input, output, and a hidden
  57. layer) sigmoid classifier that was trained on batches of 32 samples
  58. from players with respect to features: normalized perceived fatigue,
  59. sliding average of perceived fatigue, sliding average over sleep
  60. hours, and the perceived sleep quality of the players. It is optimized
  61. through the Adam optimizer with a learning rate of .005 and cross
  62. entropy to calculate the loss between the logits and labels.
  63. The logits of the work are a confidence output on which class the
  64. network feels the sample most likely belongs to, the real value of
  65. which is the classification of perceived performance by the player.
  66. Through this method, we can show a correlation between fatigue, sleep,
  67. and self-rated performance, as well as a means to predict this
  68. self-rate performance based off of fatigue and self-perceived sleep
  69. quality.
  70. Results with LR=.01, Batch=32:
  71. - Accuracy before training: 20.44388%
  72. - Loss after step 49: .531657
  73. - Accuracy after training: 74.846625%
  74. - F1 Score: .94
  75. ![](media/datafest/network.png)
  76. ## Future Work
  77. With more data to to test with we can further improve and validate out
  78. models. With historical data from other teams we can take our analysis
  79. one step further. Based on the training, performance, and fatigue
  80. information from other teams we can use that to create a model to make
  81. a recommendation for our team’s training. This model would be able to
  82. make recommendations for our training intensity leading up to a game.
  83. Since this will be heavily dealing with multivariate time series data
  84. leading up to a game, using a Long Short-term Network (LSTM) would
  85. bring promising results.