Faculty

Lab Director
Research Interests: Big Data, Data Mining, Mobile Systems, Healthcare Systems, Fuzzy Logic
(Collaborator, Department of Environmental Engineering, IUT)
Research Interests: Environmental Impact Assessment, Air Pollution Modelling, Climate Change, HSE and Environmental Risk Managment
Prof. Jaleh Varshosaz
(Collaborator, Isfahan University of Medical Sciences)
Research Interests: Nanotechnology in Drug Delivery, Bioinformatics
(Collaborator, Carleton University, Ottawa, Canada)
Research Interests: biomedical informatics, bioinformatics, pattern classification, patient monitoring, proteomics

PHD Students

Research Interests: data fusion, spatial data, uncertainty
Start: Sep. 2012
(Supervisor Dr. Hossein Moradi, Advisor: Dr. Ghadiri)
Research Interests: Spatial Decision Support System, Environmental Impact Assessment of Road Networks
Start: Sep. 2013
Ensie Davoodijam
PhD student
Research Interests:
Start: Sept. 2015
Maryam Lotfi Shahreza
Research Interests: bioinformatics, drug repositioning, complex networks, big data
Start: Feb. 2012

MSc Students

Research Interests: Big Data
Start: Sep. 2014
Research Interests: Big Data, Data Mining, Bioinformatics
Start:
Research Interests: Big Data, Data Mining, Bioinformatics
Start: Sep. 2015
Research Interests: Data Mining , Big Data
Start: Sep. 2015
Mehdi Joodaki
Research Interests: big data, urban data mining
Start: Sep. 2016
Amir Hosein Atashkar
Research Interests: big data, urban data mining
Start: Sep. 2016

Alumni

  • Research Interests: Bioinformatics, Evolutionary Computation
    Start: Sep. 2013
    Finish: Jun. 2016
    Thesis Title: Constructing reliable interaction networks based on fuzzy inference and detecting protein complexes
    Thesis Abstract: Bioinformatics is a new scientific research area in which computers, specialized software, and databases are exploited to solve biological problems especially in molecular and cellular contexts.A working domain in bioinformatics is protein complex detection. Protein complexes are groups of proteins that collaborate to perform a particular function inside the living cell. In this filed, researchers have focused on designing algorithms to improve this grouping over the proteins. presented algorithms in this filed,identification protein complex from Graph made of the interactive network. A group of these methods A class of these methods on weighted graphs and other hands on the graphs weightless apply on weighted graphs and other hands apply on the unweighted graph. Surveys on interaction networks show that about 50 percent of detected interactions are false positives. Based on these studies, a general improvement for protein interaction networks could be building the network from multiple biological data sources to determine the weight of each interaction, and to obtain a more reliable network. In this research, several biological data sources are used to build the network. As every biological data source is prone to errors, by using different data sources and fusing them using fuzzy logic, we will be able to detect the errors and fade their effect on the detection of protein complexes. Moreover, we have proposed a general protein complex detection algorithm by exploiting the strong points of other algorithms and existing hypotheses regarding real complexes. Finally, the proposed method applied on Gavin, Krogan, Collin, DIP Data sets. Precision act better respectively 0.25, 0.23, 0.10, 0.27 and F-measure respectively 0.12, 0.14, 0.04, 0.18.
  • Research Interests: Data Mining, Big Data, Databases, Medical Text Mining, Machine Learning
    Start: Sep. 2014
    Finish: Jan. 2017
    Thesis Title: Single-document and multi-document concept-based biomedical text summarization
    Thesis Abstract: In recent decades with rapid increase in the volume of available textual information resources, automatic text summarization has become a useful tool to acquire and mange intended information. Using text summarization tools, clinicians and researchers in the biomedical domain can save their time and effort to manage numerous textual information resources. Various summarization methods have been developed so far using different approaches. Some available summarizers utilize term-based methods and generic criteria to measure the informativeness of sentences. Regarding the characteristics of biomedical text, it seems that there is a requirement to employ more efficient measures by biomedical summarizers. To address this issue, we propose a method that uses concept-level analysis of text in combination with itemset mining to identify the main subtopics of input text. In this method, the informativeness of each sentence is measured according to its meaning and the appearance of main subtopics in the sentence. Some biomedical summarizers use the frequency of concepts extracted from input text to select related sentences. To address challenges related to such methods, we propose another summarization method that utilizes concept-level analysis and Bayesian inference. This summarizer estimates the probability of selecting sentences for final summary by following the distribution of important concepts within input document. We performed extensive experiments to evaluate the performance of these two methods for single-document and multi-document summarization. The results of evaluations show that compared to the competitor methods, the two summarizers proposed in this thesis improve the performance of biomedical text summarization.
  • Somayeh Davari
    Research Interests:
    Start: Sep. 2012
    Finish: Jan. 2015
    Thesis Title: Implementation of Fuzzy Region Connection Calculus and its Application in Spatial Relationship of Diseases
    Thesis Abstract: Spatial data are important in today applications, so that every day we witness the extended use of them. Spatial data include the location and properties of spatial objects such as points, lines and regions. Spatial relations constitute an important form of human understanding of spatial formation. In this context, the relationships between spatial objects, especially topological relations between regions, have attracted considerable attention. However, real-world spatial regions such as lake or forest have no exact boundaries and are fuzzy. So it would be better to define the relationships between them in as fuzzy relations. Fuzzy topological relations have application in many contexts, including path tracking algorithms based on fuzzy relations, medical diagnosis of the patient's file, extracting topological relations from the Web, image interpretation, robot navigation and manipulation, brain MRI segmentation, soil science and many other contexts. So far, several researches are conducted on modeling fuzzy spatial topological relations, and progresses have been made. Some methods for modeling fuzzy spatial regions and fuzzy relationships between them have been proposed. However, given the huge amount of data stored in spatial databases, and the fact that existing spatial database systems are based on non-fuzzy relations, we require data processing methods that are based on fuzzy spatial relations. Therefore, the fuzzy enrichment of relations in spatial databases can improve data processing techniques and decision making based on them, as well as improving the user interface in comparison to most today systems. In this thesis, a novel method is proposed for implementing fuzzy relations in spatial databases that is applicable to many applications. As an important application, the relationships used to analyse the spatial relationship between diseases will be evaluated. Additionally, a method based on fuzzy RCC relations for fuzzification of an important group of spatial queries namely the skyline operator is proposed that can be used in decision support, data visualization and spatial databases applications. The proposed algorithms have been implemented and evaluated on real-world spatial datasets. The results of the evaluation of these algorithms show that more flexibility in comparison with existing methods, and speed and quality of the results are appropriate.
  • Amir Hossein Goudarzi
    Research Interests: Spatial data, open source software
    Start: Sep. 2012
    Finish: Sep. 2014

    Last news: PhD Student at Iran University of Science and Technology, Tehran, Iran

    Thesis Title: Classification of spatial data in order to manage the development of urban regions using MOSES algorithm
    Thesis Abstract: Spatial data is one of the most important and sensitive elements of social, economic and political decision making in life today. This is why many needs, goals and different organizational activities are dependent to the knowledge earned from spatial data which is especially important to strategic planning. Researches done in this field are usually neglecting the deep knowledge mined from geographical databases and are based on pure statistical methods. So, classification of urban regions may represents a comprehensive basis for land use and finally makes the decision making based on the deep knowledge mined from geographical databases. Due to the huge volume of data gathered in spatial databases, mining association rules and high level knowledge representation matters a lot. Specifically and in context of maps and spatial data, many spatial data mining algorithms have been proposed. Despite, there are few algorithms who can manage geographical and non-geographical data using topological means, many decision making problems like developing urban areas require such perception and reasoning. For, in this thesis an approach based on genetic programming, statistical modelling and knowledge representation, is represented. To apply MOSES mining rules considering fuzzy topological relations from spatial data, a hybrid architecture called GGEO benefiting from fuzzy region connection calculus is proposed and implemented. Overcoming the problem of time consuming topological relationships calculations, this method is based on data preprocessing. GGEO analyzes and learns from geographical and normal data simultaneously and uses topological distance parameters, representing a series of arithmetic-spatial formulas as classification rules. This approach is resistant to noisy data. Also all its stages run in parallel increasing speed. This approach may be used in different spatial data classification problems as well as representing an appropriate method of data analysis and economic policy making. To represent the application of the mined knowledge in decision making problems in urban planning domain, the method’s been used in a highway planning problem with limited funds.
  • Research Interests: Data Mining, Machine Learning, Big Data, Privacy and Security
    Start: Sep. 2012
    Finish: Jul. 2014

    Last news: PhD Student at Florida State University.

    Thesis Title: Security in location-based social networks
    Thesis Abstract: By the technology developments and especially advent of smart phones with location sensors, location-based services become more popular everyday. The main attribute of these services is using user’s location information and supplying response based on location information. These services are so attractive because they are correlated to user’s real life, but they can also be threatening in case of misuse, thus privacy and security is an important issue in these services. There are many proposed methods to achieve this goal and the most important aspect of all methods is that the information send and reveal to whom and as much as needed for the service. Spatial and temporal cloaking, adding noise and trusted anonymizer server are some of most known methods, but none of them achieve this goal completely and they have some disadvantages either. In this thesis after a survey on previous methods, we propose a novel method based on fuzzy clustering and Bayesian theorem to improve clustering precision and it has been shown that this algorithm could deal with noisy datasets and clustered records with low certainty and thus it helps us to users location information more accurately even with added noise and we could extract users common seen places and thus determine user and his pattern. By this method we show that previous methods could not be trusted and especially if an invader has access to trusted server, he could achieve so many information about users even if they use pseudonym and thus trusted server is untrustable either. Then we propose a method based on separating queries from each others and users and we use ticket instead of ID for users. In this method we use cryptographic and hashing table to distribute queries and responses irregularly and thus the correlation between continuous queries will be eliminated and thus it is impossible for the intruder to determine users based on a sequence of queries. Proposed methods has been tested on some datasets such as MSR location dataset and experimental results show significant improvements in both methods.
  • Ilnaz Khodadadi
    (Supervisor: Dr. Seyed Rasoul Mousavi, Advisor: Dr. Ghadiri)
    Research Interests:
    Start: Sep. 2011
    Finish: Dec. 2013
    Thesis Title: Computational methods for haplotype reconstruction
    Thesis Abstract: Human genomes vary from each other in certain positions. Single nucleotide polymorphism (SNP), is the most common type of these variations. SNPs are of importance in drug design and medical diagnostic applications.And they also offer highest resolution for tracking disease genes. In diploid organisms such as human, the chromosomes come in two copies (one inherited from the mother and one from the father). The SNP sequence on each copy of chromosome is called haplotype. Current sequencing technologies can only provide fragments in at most several thousand base pairs and they cannot tell which copy of chromosome the fragments belong to. Hence computational methods are used to rebuild two haplotypes from these fragments. Sequencing technologies provide quality value Q, which is an integer mapping of q (the probability that the corresponding base call is incorrect). The current state-of-the-art single individual haplotyping algorithm uses Max-SAT. In this research, we have proposed a novel method that uses weighted Max-SAT and WMLF model instead of Max-SAT and MEC model, with purpose of using quality values to make results more accurate. There are several models for single individual haplotyping problem. In this research, is shown that a probability model and the MEC model are equivalent under reasonable approximations. Although there are some criticisms on the MEC model, this shows its rationality. We also have proposed a novel metric "Weighted Reconstruction Rate". To evaluate proposed algorithm, we compared it with two other methods using MEC and proposed metric. The results of this comparison on real data correspond to NA12878 show slight improvements. It should be noted that the accuracy of methods are more than 90%.
  • Bagher Saberi
    Research Interests: Spatial Data, Data Quality
    Start: Sep. 2011
    Finish: Jan. 2014

    Last news: Network Admin at Iran Blood Transfusion Organization

    Thesis Title: Spatial Data Quality Assessment: A Sample-Based Approach
    Thesis Abstract: Spatial data is playing an emerging role in new technologies such as web and mobile mapping and Geographic Information Systems (GIS). Important decisions in political, social and many other aspects of modern human life are being made using location data. Decision makers in many countries are exploiting spatial databases for collecting information, analyzing them and planning for the future. In fact, not every spatial database is suitable for this type of application. Inaccuracy, imprecision and other deficiencies are present in location data just as any other type of data and may have a negative impact on credibility of any action taken based on unrefined information. So we need a method for evaluating the quality of spatial data and separating usable data from misleading data which leads to weak decisions. On the other hand, spatial databases are usually huge in size and therefore working with this type of data has a negative impact on efficiency. To improve the efficiency, we need a method for shrinking the volume of data. Sampling is one of these methods, but its negative effects on the quality of data are inevitable. In this paper we are trying to show and assess this change in quality of spatial data that is a consequence of sampling. We used this approach for evaluating the quality of sampled spatial data related to mobile user trajectories in China which are available in a well-known spatial database. The results show that sample-based control of data quality will increase the query performance significantly, without losing too much accuracy. Based on this results some future improvements are pointed out which will help to process location-based queries faster than before and to make more accurate location-based decisions in limited times.
  • Amin Beiranvand
    Research Interests: Linked Data, Federated Queries
    Start: Sep. 2011
    Finish: Jan. 2014

    Latest news: Java Developer at Informatics Services Corporation

    Thesis Title: Adaptive Processing of Federated Queries over Linked Data based on Tuple Routing
    Thesis Abstract: Recent achievements of linked data implementations and increased number of datasets available on the web as linked data, has given rise to the need and tendency toward processing federated queries over these datasets. Due to distribution of linked data across the web, the methods that process federated queries through a distributed approach are more attractive to users and have gained more prosperity. In distributed processing of federated queries, we need methods and procedures to execute the query in an optimal manner. Most existing methods perform the optimization task based on statistical information, whereas the query processor does not have precise statistical information about their properties, since the data sources are autonomous. When precise statistics are not available, the possibility of wrong estimation highly increases and may lead to inefficient execution of query at runtime. Another problem of existing methods is that in optimization phase, they assume that runtime conditions of query execution are stable, while the environment in which federated queries are executed over linked data is dynamic and non-predictable. By considering these two problems, there is a great potential for exploiting query processing techniques in an adaptive manner. In this paper, an adaptive method is proposed for processing federated queries over linked data which is based on the concept of routing the tuples. The proposed method is able to execute the query effectively without any need to prior statistical information. This method can change the query execution plan at runtime so that less intermediate results are produced, and it can also adapt the execution plan to new situation if unpredicted network latencies arise. Evaluation of our method by running real queries over well-known linked datasets shows very good results especially for complex queries.

BSc. Students (Alumni)

  • Research Interests: Supercomputing, Parallel Processing, Machine Learning, Big Data
    Start: Sep. 2011
    Finish: Sep. 2016

    Last news: PhD Student at Florida State University.

  • Research Interests: Cloud Computing
    Start: Sep. 2011
    Finish: Sept. 2015

    Latest news: Javascript Developer at Mahan Airlines

  • Research Interests: Cloud Computing, Networking, Database, Artificial Intelligence
    Start: Sep. 2011
    Finish: Sept. 2015

    Last news: Masters student at Iran University of Science and Technology

  • Mahbod Milanizadeh
    Research Interests: Android Software Architecture
    Start: Sept. 2011
    Finish: Sept. 2015
  • Elaheh Ebrahimi
    Research Interests:
    Start: Sep. 2011
    Finish:
  • Fahime Berenjkoub
    Research Interests:
    Start:
    Finish:
  • Research Interests: Big Data, Data Mining, Machine Learning, Parallel Processing
    Start: Sep. 2012
    Finish: Sep. 2016
  • Sepehr Bayat
    Research Interests: Machine Learning, Speech Processing
    Start: Sep. 2012
    Finish: Sep. 2016

    Latest news: MSc student at Carleton University

  • Research Interests: Social computing, Information Visualization, Machine Learning, Human-Computer Interaction, Complex Networks
    Start: Sep. 2012
    Finish: Jul. 2016

    Latest news: MSc student at  Sharif University of Technology

  • A Calendar Application on Android
    Research Interests:
    Start: Sept. 2010
    Finish: Sept. 2014

    Last news: MSc Student at Sharif University of Technology