Documents created and distributed on the Internet are ever changing in various forms. Most of existing works are devoted to topic modeling and the evolution of individual topics, while sequential relations of topics in successive documents published by a specific user are ignored. In order to characterize and detect personalized and abnormal behaviors of Internet users, we propose Sequential Topic Patterns (STPs) and formulate the problem of mining Useraware Rare Sequential Topic Patterns (URSTPs) in document streams on the Internet. They are rare on the whole but relatively frequent for specific users, so can be applied in many real-life scenarios, such as real-time monitoring on abnormal user behaviors. Here present solutions to solve this innovative mining problem through three phases: pre-processing to extract probabilistic topics and identify sessions for different users, generating all the STP candidates with (expected) support values for each user by patterngrowth, and selecting URSTPs by making user-aware rarity analysis on derived STPs. Experiments on both real (Twitter) and synthetic datasets show that our approach can indeed discover special users and interpretable URSTPs effectively and efficiently, which significantly reflect users’ characteristics.
                                
                                
                                    
                                    Web mining, sequential patterns, document streams, rare events, pattern-growth, dynamic programming
                                
                                
                                
                                
                                    
                                        
                                        
                                        
                                        
                                            
                                                
                                                    International Journal of Trend in Scientific Research and Development - IJTSRD having
                                                    online ISSN 2456-6470. IJTSRD is a leading Open Access, Peer-Reviewed International
                                                    Journal which provides rapid publication of your research articles and aims to promote
                                                    the theory and practice along with knowledge sharing between researchers, developers,
                                                    engineers, students, and practitioners working in and around the world in many areas
                                                    like Sciences, Technology, Innovation, Engineering, Agriculture, Management and
                                                    many more and it is recommended by all Universities, review articles and short communications
                                                    in all subjects. IJTSRD running an International Journal who are proving quality
                                                    publication of peer reviewed and refereed international journals from diverse fields
                                                    that emphasizes new research, development and their applications. IJTSRD provides
                                                    an online access to exchange your research work, technical notes & surveying results
                                                    among professionals throughout the world in e-journals. IJTSRD is a fastest growing
                                                    and dynamic professional organization. The aim of this organization is to provide
                                                    access not only to world class research resources, but through its professionals
                                                    aim to bring in a significant transformation in the real of open access journals
                                                    and online publishing.