eeda6d3e768657e1063e5c6c50c79d93.ppt
- Количество слайдов: 28
Mining Mobile Group Patterns: A Trajectory-based Approach San-Yih Hwang, Ying-Han Liu, Jeng-Kuen Chiu NSYSU, Taiwan Ee-Peng Lim NTU, Singapore
Outline n n n Introduction Problem Definition Algorithms Evaluation Conclusions
Introduction n Many ways can determine the groups an object belongs to. n n n Grouping based on demographics Grouping based on purchasing behavior Groups formed by using spatial-temporal information are useful. n n Objects within a mobile group tend to closely influence one another. Potential applications: n n Construction of social network Animal behavior study group-based pricing models or marketing strategies Previous work gives a precise definition about mobile groups and derives algorithms for efficiently identifying mobile groups. n n Physical proximity between group members. Temporal proximity between group members.
Original Problem Definition
Problem Definition
Problem Definition n Definition 1. Given a group of users G, a maximum distance threshold max_dis, and a minimal time duration threshold min_dur, a set of consecutive time points [t, t+k] is called a valid segment of G if 1. All users in G are not more than max_dis apart at time t, t+1, …, and t+k; 2. Some users in G are more than max_dis apart at time t-1: 3. Some users in G are more than max_dis apart at time t+(k+1); 4. (k+1)>=min_dur
Problem Definition max_dis=10, min_dur=3;
Problem Definition n Definition 2. Let P be a mobile group with valid segments s 1, …, sn, and N denotes the number of time points in the database, the weight of P is defined as:
Problem Definition n If the weight of a mobile group exceeds a threshold min_wei, we call it a valid mobile group. For example, if max_dist=10, min_dur=3, min_wei =50%, the mobile group {u 2, u 3, u 4} is a valid mobile group, since it has valid segments{[1, 3][6, 8]} and weight 6/10>0. 5. The mobile group mining problem: Given D, max_dis, min_dur, and min_wei, find all valid mobile groups.
Pitfalls of the location model n n n To maintain accurate location tracking, the frequency of sampling users’ locations must be high. (Tracking 1000 users every second will result in 1 GB per day) In reality, moving objects may be disconnected from time to time voluntarily or involuntarily. It is almost impossible to have perfectly synchronized sampling of users’ locations in reality.
Remedies n n n Use trajectories with untraceable periods to model user locations The mobile group mining problem has to be redefined. The algorithms have to be modified.
Trajectory model n A trajectory T is a set of piecewise linear functions, each of which maps from a disjoint time interval to an ndimensional space. n E. g.
Trajectory-based location DB reference_ point (3, 1) 0 3 (7, -11) (1, 5) 3 5 (4, 3) 6 9 (2, 2) (2, 1) 0 3 (2, -13) (2, 6) 3 5 (-4, 5) o 3 end_ti me (10, -3) o 2 start_t im e (1, 1) o 1 veloc it y (3, 2) 6 10 (2, 4) (3, 1) 0 3 (17. -5) (-2, 4) 3 5 (12, 35) (-1, 4 5 8
How to convert location data into trajectories n The change point detection problem n Recursive linear regression
How to convert location data into trajectories n Dead reckoning
Determining the distance of 2 objects n For trajectories of two objects o 1 and o 2 n n Synchronize linear pieces Calculate the distance for each time segment Object o 1: Object o 2:
Determining the distance of 2 objects n n n Location of object o 1 at time t: (1 + 3 t , 1 + t) Location of object o 2 at time t: (2 + 2 t, 2 + t) Enclidean distance of o 1 and o 2 when 0 t<3:
Determining close intervals n n Given a distance function dist(t) of two objects o 1 and o 2 within an interval I, we would like to identify the subintervals I’ in I such that dist(t) max_dis, t I’. E. g. n Let 3=max_dis= n [ ] [0, 3)= [0, 3)
Definitions n For a user group P n n n Geographically close, far, or undecided at a time point t. The valid close segments and valid far segments of P can be accordingly defined. The weight of P is defined as
The problem n The problem is to find all valid mobile groups under such a model n Apriori property still holds n if a moble group is valid, all of its subgroup will also be valid.
Apriori Trajectory-based Group Pattern Mining
Trajectory VG-Growth n n The set of valid 2 -groups form a graph called VG-graph The close and far segments of a conditional TVG graph have to be properly updated. n n c(o 1 , o 2 | o 3) = c(o 1 , o 2)∩c(o 1 , o 3)∩c(o 2 , o 3) f(o 1 , o 2 | o 3) = f(o 1 , o 2)∪f(o 1 , o 3)∪f(o 2 , o 3)
An example
Performance evaluation DBI: 100 M 100 N DBII: 100 M 500 N DBIII: 100 M 1000 N
Performance evaluation n We compare the other two methods for handling untraceable intervals for objects n n n Pessimistic Linear Performance metrics
Performance Evaluation
Conclusions n n We have defined the mobile group mining problem on a new location model and proposed algorithms. Future work n n n Correcting location measurement error Calendar-based mobile group mining Applications using mobile group patterns
eeda6d3e768657e1063e5c6c50c79d93.ppt