written 8.8 years ago by |
Data Warehousing, Mining and Business Intelligence - Dec 2012
Information Technology (Semester 7)
TOTAL MARKS: 100
TOTAL TIME: 3 HOURS
(1) Question 1 is compulsory.
(2) Attempt any four from the remaining questions.
(3) Assume data wherever required.
(4) Figures to the right indicate full marks.
Solve any four:-
1(a) What is noisy data. How to handle it?(5 marks)
1(b) What is Market Segmentation?(5 marks)
1(c) Explain fact less fact table with suitable example.(5 marks)
1(d) How FP tree is better than Apriori Algorithm.(5 marks)
1(e) Differentiate between Periodic Crawler and Incremental Crawler.(5 marks)
2(a) Explain multidimensional association rules with suitable example.(10 marks)
2(b) Explain spatial data cube construction and spatial OLAP with example.(10 marks)
3(a) Explain Hoeffding Tree algorithm with example.(10 marks)
3(b) What is web mining? Explain web content mining with reference to personalization, harvest system.(10 marks)
4(a) What is clustering? Explain requirements and applications in detail.(10 marks)
4(b) Explain Agglomerative clustering with an example.(10 marks)
5(a) Write difference between OLTP and OLAP explain different OLAP operations.(10 marks)
5(b) Explain Regression? Explain Linear Regression with example.(10 marks)
6(a) Explain HITS Algorithm in Web mining.(10 marks)
6(b) A database has four transactions. Let minimum support and confidence is 50%
(10 marks)
Write short notes on any two :-
7(a) Issues in classification and explain any one technique of classification.(10 marks) 7(b) Sequence mining in transactional database.(10 marks) 7(c) Text mining approaches.(10 marks) 7(d) Fraud detection.(10 marks)