-
If my time series has more than 1 value, say a geolocation - i.e. lat and long - over time, can you provide an example or how to use STUMPY to analyze the the time series signal? |
Beta Was this translation helpful? Give feedback.
Replies: 11 comments 27 replies
-
It depends on what you are trying to accomplish. The first thing you could try is to compute the matrix profile for each variable independently from the other using Alternatively, one could compute what is called a "multi-dimensional matrix profile" using What have you tried already? What are you trying to accomplish? |
Beta Was this translation helpful? Give feedback.
-
Additionally, if you haven't had a chance to look through then, you may find some useful ideas in our tutorials. |
Beta Was this translation helpful? Give feedback.
-
An example for what I am trying to accomplish is analyzing a bus route. A bus takes the same route daily so there is periodicity to the data (lat/lon of the bus tracked over time every day). I'd like to identify when the bus "deviated" from its route. This could be getting stuck (say in traffic) or perhaps even taking an alternate route (due to detour). The data I have is a time series data of lat/lon over a whole year. Hope that helps explain what I am trying to accomplish. In this vein, maybe each variable could be treated independently? I would think a delay does affect both variables, same with taking a detour. So ideally perhaps this is multi-variate analysis and really suited for Eventually though I suspect I need Thank you again for the guidance. |
Beta Was this translation helpful? Give feedback.
-
This is a pretty interesting example! So, technically, In your case, you feed in a 2D time series ( I hope that helps. |
Beta Was this translation helpful? Give feedback.
-
Btw, is this bus data publicly available? If so, I'd like to take a look at it to see if it could serve as an example in our tutorial. |
Beta Was this translation helpful? Give feedback.
-
Ok I'll take a look at mstump. Is there a tutorial for out I can use as guidance? Ps: The bus data I have access to seems to come from a few sources and not a single public source. If I can share any of my data will let you know. |
Beta Was this translation helpful? Give feedback.
-
There is this tutorial that is incomplete and still evolving and so it might generate more questions than answers. I still recommend reading the original paper mentioned above. |
Beta Was this translation helpful? Give feedback.
-
Technically, STUMPY (or matrix profiles in general) do not actually take the "time" into consideration when computing a matrix profile. Instead, it processes and ordered sequence or series of data. At least, that's my mental model. You are correct in that STUMPY assumes that your data points should be (roughly) equally spaced and we do not provide any tools for that pre-processing work as there are many other specialized tools that can help with that (i.e., Pandas and NumPy are great). When you have missing data points, you may consider setting those values to |
Beta Was this translation helpful? Give feedback.
-
In your case, since you actually care about 1) including both dimensions and 2) looking for discords (i.e., both
It's getting late here so hopefully I didn't misspeak in any of the above. |
Beta Was this translation helpful? Give feedback.
-
ICYMI: The MSTUMP tutorial has been published |
Beta Was this translation helpful? Give feedback.
-
One question @Darveesh : why don`t you rasterize the geolocations such as https://github.com/uber/h3 H3 then you should be able to get rid of one dimension. |
Beta Was this translation helpful? Give feedback.
In your case, since you actually care about 1) including both dimensions and 2) looking for discords (i.e., both
lat
andlon
have to deviate and not just one) then I think you want to do something like:It's getting late here so hopefully I didn't misspeak in any of the above.