21-03-2011, 09:31 PM
Sir/Madam,
I am Sampath studying B.E.(ISE) at VCET,Mangalore. I want seminar report and ppt on the topic "E-MINE: A novel web mining approach".
Please send those to my mail id "sampathputtur[at]gmail.com".
Thank you.
21-03-2011, 09:31 PM
Sir/Madam, I am Sampath studying B.E.(ISE) at VCET,Mangalore. I want seminar report and ppt on the topic "E-MINE: A novel web mining approach". Please send those to my mail id "sampathputtur[at]gmail.com". Thank you.
07-04-2011, 04:37 PM
E-MINING.pptx (Size: 1.63 MB / Downloads: 189) E-MINING-A NOVEL WEB MINING APPROACH DEFINITION It is a technique that mines relevant data regions from a web page. THE PROPOSED TECHNIQUE E-Mine – An effective method to mine the data region from a web page automatically It enables the system to identify gaps that separate records, which helps to segment data records correctly. The visual information also contains information about the hierarchical structure of the tags. By observing a webpage, it can be analysed that the relevant data region occupies the major central part of the Webpage. SYSTEM OF THE e-Mine TECHNIQUE HOW ALGORITHM WORKS? Determining the height and width of all bounding rectangles. Identification of the largest rectangle. Identification of the container within the largest rectangle. Identification of data region containing data records with in the container. STEP 1 DETERMINING HEIGHT AND WIDTH OF ALL BOUNDING RECTANGLES Determine the dimensions of all the bounding rectangles in the web page. If not specified, the MSHTML parsing and rendering engine of Microsoft Internet Explorer 6.0 can be used. STEP 2 IDENTIFICATION OF THE LARGEST RECTANGLE Based on the height and width of bounding rectangles obtained in the previous step, we determine the area of the bounding rectangles Among these rectangles determine the largest rectangle. PROCEDURE FOR IDENTIFICATION OF LARGEST RECTANGLE Procedure getMaxRect Input: <body> of the HTML source for each child of <body> tag Begin Find the coordinates of the bounding rectangles for the child If the area of the bounding rectangle > area of maximum Rectangle then Maximum Rectangle = child Endif end STEP 3 Identification of the container with in the largest rectangle Once the largest rectangle is obtained, we determine the bounding rectangle having the largest area in the set. The reason for determining the largest rectangle within this set is that only the largest rectangle will contain data records. Procedure getContainer Input: The Largest Rectangle out of all Bounding Rectangles. List_of_Children=list of all the children tags associated with Maximum Rectangle. for each tag in List_of_Children begin if area of bounding rectangle of a tag > half the area of Maximum Rectangle then container = tag Endif End. IRRELEVANT PORTION TO BE FILTERED STEP 4 Identification of data region containing data records with in the container Filter is used to remove the irrelevant data from a container PROCEDURE FOR FILTER Input: The container obtained from the previous step. totalHeight=0 for each child tag within container totalHeight+=height of the bounding rectangle of child averageHeight = totalHeight/no of children of container averageHeight = totalHeight/no of children of container for each child within container if height of child’s bounding rectangle < averageHeight then Discard child from container endif end for end for ADVANTAGES Overcomes the disadvantages of the existing automated approaches. Eg: MDR Algorithm. It enables the system to identify gaps that separate records, which helps to segment data records correctly. The visual information also contains information about the hierarchical structure of the tags. DISADVANTAGES It may extract large amount of unwanted data The extracted relevant data region from a web page may not be of users interest CONCLUSION This is a new approach to extract structured data from web pages eMine is a pure visual structure oriented method that can correctly identify the data regions. eMine overcomes the drawbacks of existing methods and performs significantly better than existing methods.
14-09-2011, 12:38 AM
I think you mentioned wrong disadvantages because it only extracts center portion which contains only useful data but not unwanted data.
03-02-2012, 10:24 AM
to get information about the topic WEB MINING full report ,ppt and related topic refer the link bellow
https://seminarproject.net/Thread-web-mi...nar-report https://seminarproject.net/Thread-web-mining https://seminarproject.net/Thread-web-mining?page=2 https://seminarproject.net/Thread-web-mining?page=4 https://seminarproject.net/Thread-e-mine...g-approach https://seminarproject.net/Thread-open-w...web-mining https://seminarproject.net/Thread-webmin...nalization https://seminarproject.net/Thread-web-mining--24452 https://seminarproject.net/Thread-signed...t-outliers https://seminarproject.net/Thread-web-mining?pid=65307 https://seminarproject.net/Thread-freque...b-log-data
13-02-2012, 11:56 PM
send me this ppt
14-02-2012, 10:05 AM
to get information about the topic WEB MINING full report ,ppt and related topic refer the link bellow
https://seminarproject.net/Thread-web-mi...nar-report https://seminarproject.net/Thread-web-mining https://seminarproject.net/Thread-web-mining?page=2 https://seminarproject.net/Thread-web-mining?page=4 https://seminarproject.net/Thread-e-mine...g-approach https://seminarproject.net/Thread-open-w...web-mining https://seminarproject.net/Thread-webmin...nalization https://seminarproject.net/Thread-web-mining--24452 https://seminarproject.net/Thread-signed...t-outliers https://seminarproject.net/Thread-web-mining?pid=65307 https://seminarproject.net/Thread-freque...b-log-data
20-02-2012, 05:55 PM
[/size][/font][font=Times New Roman][size=medium]
13-03-2012, 04:57 PM
seminar report on e-mining for computer science
03-08-2012, 03:27 PM
E-MINE:A NOVEL WEB MINING APPROACH
E-MINING.pptx (Size: 482.42 KB / Downloads: 28) INTRODUCTION AND DEFINITION Now a days web is used widely as the medium of publication.Hence,a large collection of documents, images, text files and other forms of data in structured, semi structured and unstructured forms are available on web. Several attempts have been made to extract the regularly structured data from the web page. Existing automatic techniques are not satisfactory because of their poor accuracies. E-Mine – An effective method to mine the data region from a web page automatically. RELATED WORK MDR(Mining Data Records)is a technique mainly used in the area of data mining. It exploits the regularities in HTML tag structure directly. MDR algorithm makes use of all the HTML tag tree of the web page to extract data records from the page. THE PROPOSED TECHNIQUE Visual Information helps in three ways. It enables the system to identify gaps that separate records, which helps to segment data records correctly. The visual information also contains information about the hierarchical structure of the tags. By observing a webpage, it can be analysed that the relevant data region occupies the major central part of the Webpage. The E-Mine technique is based on three observations: A group of data records ,is typically presented in the neighboring region of the web page. The area covered by a rectangle that bounds the data region is more than the area covered by the rectangles bounding other regions. e.g., Advertisements and links. The height of an irrelevant data record within a collection of data records is less than the average height of the relevant data records within that data region. HOW THE ALGORITHM WORKS? Determining the height and width of all bounding rectangles. Identification of the largest rectangle. Identification of the container within the largest rectangle. Identification of data region containing data records within the container. CONCLUSION This is a new approach to extract structured data from web pages. eMine is a pure visual structure oriented method that can correctly identify the data regions. eMine overcomes the drawbacks of existing methods and performs significantly better than existing methods.
17-09-2012, 10:47 PM
plz send me ppt on e-mine:a novel web mining approach.
My id is shweturwt[at]gmial.com
16-01-2013, 09:56 AM
to get information about the topic "web mining" full report ppt and related topic refer the link bellow
https://seminarproject.net/Thread-web-mi...nar-report https://seminarproject.net/Thread-e-mine...g-approach https://seminarproject.net/Thread-web-mining-ppt https://seminarproject.net/Thread-mining...ons--41522 https://seminarproject.net/Thread-webmin...nalization |
|
Possibly Related Threads… | |||||
Thread | Author | Replies | Views | Last Post | |
hi guys i am new on web | 0 | 1,217 |
24-10-2019, 07:58 AM Last Post: |
||
the deep web seminar topic | Guest | 0 | 1,410 |
24-03-2019, 06:48 PM Last Post: Guest |
|
advantages and disadvantages embedded web technology | Guest | 0 | 757 |
20-02-2018, 11:32 PM Last Post: Guest |
|
edmonton web design seo | Guest | 1 | 880 |
18-01-2018, 02:41 PM Last Post: dhanabhagya |
|
phonet a voice based web technology ppt | Guest | 1 | 707 |
12-01-2018, 10:32 AM Last Post: dhanabhagya |
|
phonet a voice based web technology ppt | Guest | 1 | 708 |
12-01-2018, 10:24 AM Last Post: dhanabhagya |
|
edmonton web design seo | Guest | 1 | 1,150 |
06-01-2018, 04:18 PM Last Post: dhanabhagya |
|
web design siwes presenatation | Guest | 1 | 597 |
14-10-2017, 12:28 PM Last Post: jaseela123 |
|
web design siwes presenatation | Guest | 1 | 607 |
14-10-2017, 12:28 PM Last Post: jaseela123 |
|
A proxy based architecture for dynamic invocation of web services from mobile devices | Guest | 1 | 781 |
13-10-2017, 04:25 PM Last Post: jaseela123 |