What developers need to know about the rapidly growing technologies of document warehousing and text mining
This unique book shows warehouse developers and managers how to build this new type of warehouse, how to organize free-form text for easy access, and, most importantly, how to exploit text mining techniques to provide timely and accurate information for decision-makers. The author covers the complete process of building and managing a document warehouse, including examples of actual implementations, a review of security issues and tools such as XML and Wide Area Information Servers and their selection criteria, and how text mining techniques are different from data mining techniques.
"Sinopsis" puede pertenecer a otra edición de este libro.
"This book combines a thorough introduction to document warehousing with an in-depth technical tutorial for implementation. Dan Sullivan truly leaves no stone unturned. This book is my de-facto document warehousing resource!"--Jill Dyche, Baseline Consulting Group
Most business information isn't neatly stored in databases. It's more likely found in a swirl of millions of Web pages, e-mails, and free-form text documents. To capture and tame this flood of information for decision making, businesses are now turning to document warehousing and text mining techniques.
This book provides database and data warehouse developers and managers with complete guidance on how to build and manage a document warehouse, how to organize unstructured text for easy storage and retrieval, and how to leverage text mining techniques to provide timely and accurate information for decision-makers.
You'll learn how document warehousing and text mining technologies compare with their numeric data warehousing and data mining counterparts, and discover essential tools for tapping into information resources within your company and across the Internet. With the help of example applications, checklists, and templates, this book teaches you to:
* Design the architecture of a document warehouse
* Find and retrieve text documents from multiple sources
* Load information into the warehouse and transform it to the desired form
* Select the right tools to thematically index, categorize, cluster, and summarize text
* Adapt the appropriate meta data for your document warehouse
* Use text mining for operational management, customer relationship management, and competitive analysis
* Ensure the security and privacy of your document warehouse
The companion Web site at www.wiley.com/compbooks/sullivan contains:
* Scripts for common tasks
* Document templates used in the design stage of document warehouse development
* Links to related sites
DAN SULLIVAN, Director of Data Warehousing at Computer Resource Team Inc., designs business intelligence solutions for a variety of industries. He has 15 years of experience in developing natural language processing and database applications and is widely published on data warehousing and text analysis for business intelligence.
"Sobre este título" puede pertenecer a otra edición de este libro.
Descripción Wiley. PAPERBACK. Estado de conservación: New. 0471399590 New Condition. Nº de ref. de la librería NEW7.1113635
Descripción Wiley, 2001. Paperback. Estado de conservación: New. Never used!. Nº de ref. de la librería P110471399590
Descripción John Wiley & Sons. Estado de conservación: New. pp. 608. Nº de ref. de la librería 7598488
Descripción Wiley, 2001. Paperback. Estado de conservación: New. 1. Nº de ref. de la librería DADAX0471399590