need for data imputation #3

dataknut · 2019-06-28T07:04:54Z

We currently only have observations for when EVs are being driven or are charging. This means there is a lot of 'missing' data when it comes to calculating 'population' average kW charging demand etc. We need to impute missing 1 minute observations (given them 0 kW charging) and re-calculate sample means. Obviously we can't impute state_of_charge - although we cold assume some decay curve between the last and next real observation?

…t charge levels (power demand) and charging times; corrected interp of 0 charging & renamed 'fast' to 'rapid' throughout (can't remember why)

dataknut · 2019-06-28T07:06:56Z

@raffertyparker comments: "To accurately analyse time-averaged data:

Many opportunities for further research involve using mean values over time. What we currently have is mean values of the instances during which data was being collected, but it appears most of the times that the vehicles were not either driving or charging, no data was collected. This makes the time-averaged plots both messy and practically useless. As an example, in the plots of daily charging demand (deleted from main report) no weighting is given to the fact that charging is occurring more frequently at certain times than others.

The data is quite sporadic (data from different vehicles have different start dates and times, and data not sent at consistent time intervals). As far as I understand, in order to get true mean values over time we need to consider the zero values where no charging or driving was occurring. For this, I presume we need to create an "empty" dataframe for each individual vehicle that consists of a datetime column, and then columns of zeros for each variable we want to find the mean of, running the duration of each vehicle's data collection period at predetermined intervals (15 mins should be sufficient). To this we would "add" the original data that falls within each within each time interval, allowing true mean values to be found from the new dataframe. This would assume no missing non-zero data during the dates between which each vehicle has data collected."

dataknut added the dataFIx label Jun 28, 2019

dataknut self-assigned this Jun 28, 2019

dataknut referenced this issue Jun 28, 2019

substantial update & re-run of report v1 including new ways to look a…

8da32c4

…t charge levels (power demand) and charging times; corrected interp of 0 charging & renamed 'fast' to 'rapid' throughout (can't remember why)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

need for data imputation #3

need for data imputation #3

dataknut commented Jun 28, 2019

dataknut commented Jun 28, 2019

need for data imputation #3

need for data imputation #3

Comments

dataknut commented Jun 28, 2019

dataknut commented Jun 28, 2019