The newest versions of our programs now feature Data Sensing technology. This clever technology adds artificial intelligence abilities to WebToData.
WebToData can analyze the contents of a web page and sense the data in the page. Data is then presented in a delimited format ready to use in your favorite database. The program does this all automatically!
To gain the benefits of this exciting new technology all you have to do is order a registered copy of WebToData.
WebToData now have the ability to process multiple files in one session. The programs can process many files and create a corresponding file for each processed file or it can process many files and create one file containing the results of the selected files.
For database creators this means WebToData now has the ability to process many html files and create one database file. This feature is only available by purchasing the registered version.
This is a new clever technology developed by GLTSoft.
The design of most web pages is based on HTML tables. Tables are used to structure the layout of pages and to present information to web users. This creates a difficulty for any program that attempts to extract data from web pages - how to find worthwhile data? Our research has found that expensive programs that claim to have the ability to work with data found in web pages are not successful.
GLTSoft have developed an Artificial Intelligence engine that is integrated into WebToData. This engine analyses the data found in web pages, extracts it and then presents the data ready to be saved to file in a database format.
For example the following table would be found by WebToData and analyzed by the Data Sensing engine.
|Bob Smith||12 Wood St.||Miles|
|Jan Albert||44 Station Rd||Oakton|
|Dianne Wong||33 Range Rd||Clifton|
|Jack Dinnis||19 Cutting Rd||Temby|
Once analyzed, the data would be presented in the following format ready to save.
Name, Street, Town,
Bob Smith, 12 Wood St., Miles
Jan Albert, 44 Station Rd, Oakton
Dianne Wong, 33 Range Rd, Clifton
Jack Dinnis, 19 Cutting Rd, Temby
This new innovation is called data sensing technology. But it is far more than working with simple tables, such as the one presented above, and extracting data from them. This technology is designed to work with highly complex web pages typically containing multiple tables. Many of these are used to structure the layout of the page and therefore do not contain useful data. Our Data Sensing technology senses data and extracts information from the page that will most likely import directly into a database.
Naturally there is a full copy of all data provided in another window so that a full comparison can be made by the operator of the program.
Web pages are designed by humans for humans. Even if the data is coming from a web server, the formatting of the data is controlled by the designer of the web page. This means that the formatting of data does not always follow logically defined definitions.
To extract data from these sorts of web pages, we have now added 'User Controlled Data Definitions' to our programs.
This feature lets you customize data searches in web pages.
Data Pattern Matching is the next step towards giving users of our programs greater control over extracting data from web pages. Web page designers are using more creative license when they create pages based on databases. It is not uncommon to see data, that essentially should be one record, spread over a number of lines on a web page. If you have data formatted in this way then you have data that can be found by Data Pattern Matching and rebuilt as a delimited database. It is a powerful feature that makes WebToData one of the most useful software programs available.
Extended Pattern Matching further extends the capacity of WebToData to extract data from web pages. Typical uses of Extended Pattern Matching include extracting large chunks of data from web pages.
A company has created a web based survey that clients fill out to express service delivery effectiveness. Each survey returned by a client contains sensitive data and is compiled into a single web page. Web programmers who are given the job of transferring the results to a spreadsheet find that the normal tools available do not interpret the data nor provide results that are anywhere near ideal. After purchasing WebToData the company is now able to extract 99 rows of data per page (this is not the limit just the number processed in this real example) and rebuild each survey response into a record in a database. The company successfully processed 9000 surveys using the multi file processing feature, the programmers are happy and the data is successfully interpreted in a spreadsheet.
A small business man has 7000 web pages that contain information he wants to turn into data to be used in a direct mail campaign. He finds the normal data detection tools do not give the desired results. Using the extended reach of Extended Pattern Matching he successfully creates the database and reaches thousands of potential clients.