Loading...
Search for: data-mining
0.008 seconds
Total 208 records

    Parallel polynomial root extraction on a ring of processors

    , Article 19th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2005, Denver, CO, 4 April 2005 through 8 April 2005 ; Volume 2005 , 2005 ; 0769523129 (ISBN); 0769523129 (ISBN); 9780769523125 (ISBN) Sarbazi Azad, H ; Sharif University of Technology
    2005
    Abstract
    In this paper, a parallel algorithm for computing the roots of a given polynomial of degree n on a ring of processors is proposed. The algorithm implements Durand-Kerner's method and consists of two phases: initialization, and iteration. In the initialization phase all the necessary preparation steps are realized to start the parallel computation. It includes register initialization and initial approximation of roots requiring 3n-2 communications, 2 exponentiation, one multiplications, 6 divisions, and 4n-3 additions. In the iteration phase, these initial approximated roots are corrected repeatedly and converge to their accurate values. The iteration phase is composed of some iteration... 

    Using Data Mining In Supply Chain Management (SCM)

    , M.Sc. Thesis Sharif University of Technology Karimi, Hossein (Author) ; Salmasi, Naser (Supervisor)
    Abstract
    Nowadays, finding suppliers of goods and materials is easier than the past, because of developments in communications technology. Therefore, some of supply chains have expanded their relations to global area and select international partners. With an increase in the scope of suppliers available, supplier management sections encounter a large number of suppliers to choose from, and loads of information to process. Therefore proper methods must be adopted in order to evaluate the suppliers. One way is to create credit levels and ranking the suppliers according to past cooperation. Implementation of this method requires the use of tools that provide Comprehensive analysis of the occult rules... 

    Using a classifier pool in accuracy based tracking of recurring concepts in data stream classification

    , Article Evolving Systems ; Volume 4, Issue 1 , 2013 , Pages 43-60 ; 18686478 (ISSN) Hosseini, M. J ; Ahmadi, Z ; Beigy, H ; Sharif University of Technology
    2013
    Abstract
    Data streams have some unique properties which make them applicable in precise modeling of many real data mining applications. The most challenging property of data streams is the occurrence of "concept drift". Recurring concepts is a type of concept drift which can be seen in most of real world problems. Detecting recurring concepts makes it possible to exploit previous knowledge obtained in the learning process. This leads to quick adaptation of the learner whenever a concept reappears. In this paper, we propose a learning algorithm called Pool and Accuracy based Stream Classification with some variations, which takes the advantage of maintaining a pool of classifiers to track recurring... 

    A novel SVM approach of islanding detection in micro grid

    , Article 2012 IEEE Innovative Smart Grid Technologies - Asia, ISGT Asia 2012, 21 May 2012 through 24 May 2012 ; May , 2012 ; 9781467312219 (ISBN) Bitaraf, H ; Sheikholeslamzadeh, M ; Ranjbar, A. M ; Mozafari, B ; Sharif University of Technology
    2012
    Abstract
    Passive islanding detection schemes are more used in utilities due to their low costs and less harmonic problems although having larger Non Detection Zones (NDZ) relative to other schemes. Passive system measurements at the point of common coupling (PCC) are the basis of this scheme. A new approach in passive techniques is the use of intelligent based methods in data-mining to classify the system parameters which affect the islanding detection. As a result, by finding an efficient and robust data-mining algorithm, the passive schemes problem, which is their relatively large NDZ, will be minimized. In this paper, massive measurements are collected by simulation of IEEE standard distribution... 

    Semi-supervised ensemble learning of data streams in the presence of concept drift

    , Article Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) ; Volume 7209 LNAI, Issue PART 2 , 2012 , Pages 526-537 ; 03029743 (ISSN) ; 9783642289309 (ISBN) Ahmadi, Z ; Beigy, H ; Sharif University of Technology
    Abstract
    Increasing access to very large and non-stationary datasets in many real problems has made the classical data mining algorithms impractical and made it necessary to design new online classification algorithms. Online learning of data streams has some important features, such as sequential access to the data, limitation on time and space complexity and the occurrence of concept drift. The infinite nature of data streams makes it hard to label all observed instances. It seems that using the semi-supervised approaches have much more compatibility with the problem. So in this paper we present a new semi-supervised ensemble learning algorithm for data streams. This algorithm uses the majority... 

    Freshness of web search engines: Improving performance of web search engines using data mining techniques

    , Article International Conference for Internet Technology and Secured Transactions, ICITST 2009, 9 November 2009 through 12 November 2009, London ; 2009 ; 9781424456482 (ISBN) Kharazmi, S ; Farahmand Nejad, A ; Abolhassani, H ; Sharif University of Technology
    Abstract
    Progressive use of Web based information retrieval systems such as general purpose search engines and dynamic nature of the Web make it necessary to continually maintain Web based information retrieval systems. Crawlers facilitate this process by following hyperlinks in Web pages to automatically download new and updated Web pages. Freshness (recency) is one of the important maintaining factors of Web search engine crawlers that takes weeks to months. Many large Web crawlers start from seed pages, fetch every links from them, and continually repeat this process without any policies that help them to better crawling and improving performance of those. We believe that data mining techniques... 

    Evaluation and improvement of energy consumption prediction models using principal component analysis based feature reduction

    , Article Journal of Cleaner Production ; Volume 279 , 2021 ; 09596526 (ISSN) Parhizkar, T ; Rafieipour, E ; Parhizkar, A ; Sharif University of Technology
    Elsevier Ltd  2021
    Abstract
    The building sector is a major source of energy consumption and greenhouse gas emissions in urban regions. Several studies have explored energy consumption prediction, and the value of the knowledge extracted is directly related to the quality of the data used. The massive growth in the scale of data affects data quality and poses a challenge to traditional data mining methods, as these methods have difficulties coping with such large amounts of data. Expanded algorithms need to be utilized to improve prediction performance considering the ever-increasing large data sets. In this paper, a preprocessing method to remove noisy features is coupled with predication methods to improve the... 

    Event classification from the Urdu language text on social media

    , Article PeerJ Computer Science ; Volume 7 , 2021 ; 23765992 (ISSN) Awan, M. D. A ; Kajla, N. I ; Firdous, A ; Husnain, M ; Missen, M. M. S ; Sharif University of Technology
    PeerJ Inc  2021
    Abstract
    The real-time availability of the Internet has engaged millions of users around the world. The usage of regional languages is being preferred for effective and ease of communication that is causing multilingual data on social networks and news channels. People share ideas, opinions, and events that are happening globally i.e., sports, inflation, protest, explosion, and sexual assault, etc. in regional (local) languages on social media. Extraction and classification of events from multilingual data have become bottlenecks because of resource lacking. In this research paper, we presented the event classification task for the Urdu language text existing on social media and the news channels by... 

    Combining Market Segmentation and Customer Segmentation Using Three-factor Theory: the Telecom (MCI) Case Study

    , M.Sc. Thesis Sharif University of Technology Nabizadeh, Mohammad (Author) ; Najmi, Manochehr (Supervisor)
    Abstract
    Mobile and wireless services industry has been among the most attractive businesses recently and is progressing at a fast pace. In this huge industry various organizations and entities cooperate to deliver value to the end user. The mobile operator plays the pivotal role in this chain. Hence mobile operator’s market was chosen in this study as the representative of the industry. Market segmentation and customer satisfaction are the main topics of this study.
    Market segmentation for MCI was done using data mining with two-step algorithm. Input data included the consumer behavior extracted from mobile post-paid bills. The results indicated five distinctive clusters. The next part included... 

    Applying Ant Colony Optimization for Solving Facility Layout Problem with Unequal Area and Flexible Bay Structure

    , M.Sc. Thesis Sharif University of Technology Famil Farnia, Farid (Author) ; Akhavan Niaki, Taghi (Supervisor)
    Abstract
    In this thesis a model of mixed integer programming to find the optimal solution of bi-objective facility layout problem in flexible bay structure according to uncertainty in flow material among departments and closeness rating is represented. In a facility layout problem based on flexible bay structure, departments with unequal areas are allocated to parallel bays. Also each department can only be allocated in one bay. The Goals are to minimize material handling cost and to maximize closeness rating. Uncertainty in flow material and closeness rating are modeled by fuzzy numbers. Due to the high complexity of the presented model, exact methods are only able to respond to maximize of 9... 

    Application of Data Mining in Healtcare

    , M.Sc. Thesis Sharif University of Technology Oliyaei, Azadeh (Author) ; Salmasi, Nasser (Supervisor)
    Abstract
    Data mining is the one of top ten developing knowledge in the world. This study followed three fold objectives; Firstly, An efficient model based on data mining algorithms is proposed to predict the duration of hospitalization time for patients of digestive system disease that need short term care. Duration of hospitalization is an important criterion to be used for predicting the hospital resources. In order to, a combined model based on CHAID and C.5 decision trees and a neural network is suggested. The suggested model predict the duration of hospitalization with 82% accuracy. The second object of this study is to propose an algorithm based on likelihood ratio. The suggested algorithm... 

    Personalized Recommender System based on Taxi Driver’s Behavior to Optimize Supply and Demand

    , M.Sc. Thesis Sharif University of Technology Pouyabahar, Ardalan (Author) ; Heydarnoori, Abbas (Supervisor) ; Habibi, Jafar (Co-Advisor)
    Abstract
    Taxis provide a flexible and indispensable service to satisfy the urban travel demand of public commuters. Balancing supply and demand and minimizing the driving time before finding a customer would directly increase taxi drivers and system income. Availability of GPS traces and easy access to the Internet has enabled taxi companies to be aware of supply-demand level in every region and all taxi states in real time. In this research, we propose a novel fair recommender system to maximize the sum of taxi drivers’ income by recommending regions with the highest profitability due to each driver’s acceptance rate and each regions’ minimum supply availability. Experiment results on TAP30’s data... 

    Application of Data Mining in Prediction of Diabetes type 2

    , M.Sc. Thesis Sharif University of Technology Bagherzadeh Khiabani, Farideh (Author) ; Akhavan Niaki, Taghi (Supervisor)
    Abstract
    Developments in the field of data storage which is due to computers have led to an extraordinary increase in medical data just like the increase in all other fields. As a result, physicians are faced with the problem of using the stored data. Therefore, the traditional manual data analysis is inadequate due to the large amounts of data. Furthermore, the ability to use this data to extract useful information is critical for the quality of medical care. Therefore, data mining techniques arose so that we will be able to extract knowledge through applying them to the raw data and subsequently help the doctors in making decisions.
    In this study, we are pursuing four goals. First, in order to... 

    Automatic Author Age Identification Using Social Media Texts

    , M.Sc. Thesis Sharif University of Technology Askari, Maryam (Author) ; Bahrani, Mohammad (Supervisor)
    Abstract
    The most common form of communication on the internet and social network websites is text messages. normally communication on social media or even on the web is by posting some sort of text. usually, these messages or posts are short and text used in them may not follow any language standards, this makes it very difficult to process them. Different age groups use a certain language differently and this is shown in the way, each of them writes texts. The advancements made in the field of natural language processing and computational linguistics makes it possible to predict, text authors age groups by analyzing the way they write. This study focuses on ways to automatically recognize the age... 

    An Online Portfolio Selection Algorithm Using Pattern-matching Principle

    , M.Sc. Thesis Sharif University of Technology Azin, Pejman (Author) ; Khedmati, Majid (Supervisor)
    Abstract
    According to the rise of turnover and pace of trading, accelerating of analysis and making decision is unavoidable. Humans are unable to analyze big data quickly without behavioral biases so, using machines to analyze big data seems critical. Hence, financial markets tend to apply algorithmic trading in which some techniques like data mining and machine learning are notable. OLPS which sequentially allocates capital among a set of assets aiming to maximize the final return of investment in the long run, is the core problem in algorithmic trading. This article presents an online portfolio selection algorithm. The online portfolio selection sequentially selects a portfolio over a set of assets... 

    Distributed Data Mining in Peer-to-Peer Systems

    , Ph.D. Dissertation Sharif University of Technology Mashayekhi, Hoda (Author) ; Habibi, Jafar (Supervisor)
    Abstract
    Peer-to-peer (P2P) computing is a popular distributed computing paradigm for many applications which in-volve exchange of information among a large number of peers. In such applications, large amount of data is distributed among multiple dispersed sources. Therefore, data analysis is challenging due to processing, storage and transmission costs. Moreover, the data rarely remains static and frequent data changes, quickly out date previously extracted data mining models. Distributed data mining deals with the problem of data analysis in environments with distributed data and computing resources. In this dissertation, we explore distributed data mining in different structures of P2P systems. In... 

    Model Selection for Social Network Simulation in a Decision Support System

    , Ph.D. Dissertation Sharif University of Technology Aliakbary, Sadegh (Author) ; Habibi, Jafar (Supervisor) ; Movaghar Rahimabadi, Ali ($item.subfieldsMap.e)
    Abstract
    A social network represents a set of entities and their relationships. Telecommunication networks, online social networks, and paper citation networks are some examples of networks in real world. Nowadays, analysis of social networks is an interesting research area with important applications. Particularly, managers of the social networks and the decision makers often require intelligent decision support for futures study in these social systems. The demanded decision support systems make it possible to define the desired social problem and to analyze the ”what-if scenarios.“ Computer simulation is an appropriate approach toward such decision support systems. In this approach, the desired... 

    An Improved Clustering Method of Data Mining in Healthcare and Its Implementation

    , M.Sc. Thesis Sharif University of Technology Shourabizadeh, Hamed (Author) ; Akhavan Niaki, Taghi (Supervisor)
    Abstract
    In this study, a brief definition of data mining and its variants were mentioned. Then the methods and algorithms for clustering and their application in the field of healthcare is studied. Concidering the available data for anemia disease, including numeric and categorical attributes, the k-medoids clustering algorithm was selected. This algorithm is one of the simple, powerful and most widely used methods for clustering. The drawbacks of this algorithm are as follow: requires a user input on the number of clusters, depends on the initial data and traps in the local optima. In this thesis, an improved method of clustering-based on Random Forest and k-medoids algorithms has been developed.... 

    Identifying and Predicting Tumor and MS Disease Through MRI Data of Patients by Data Mining Tools

    , M.Sc. Thesis Sharif University of Technology Moazeni, Mehran (Author) ; Akhavan Niaki, Taghi (Supervisor)
    Abstract
    Today with the development of technology in medical science, there is a need to develop new methods to analyze and process the medical images. Furthermore, increasing use of machines and computers to accomplish prediction goals delineates that these tools had promising results. Because of all the above, this research focuses on processing and analyzing medical images with using data mining tools in order to identify MS and tumor disease which have been ubiquitous in last decades, fast and meticulous. To do so, we introduce a new clustering algorithm based on the modularity measure of graph networks as well as a new machine learning algorithm based on Kalman filter for Tensor-based data.... 

    Web Anomaly Host Based IDS, a Machine Learning Approach

    , M.Sc. Thesis Sharif University of Technology Khalkhali, Iman (Author) ; Azmi, Reza (Supervisor) ; Khansari, Mohammad (Co-Advisor)
    Abstract
    Web servers and web applications are susceptible to different attacks. In order to detect web-based attacks Intrusion detection systems (IDS) should be equipped with a large number of signatures. Unfortunately various types of web threats are increasingly growing and so detection and prevention of all these new and old attacks is exhaustive and really difficult.This thesis represents a designed system for intrusion detection that uses different techniques to discover vulnerabilities with derived patterns and also some user behavior based attacks against web applications. This was done by using new dataset which was generated by new log file.The primary objective of this thesis shows the...