Register for your free TechRepublic membership or if you are already a member, sign in using your preferred method below.
We recently updated our Terms and Conditions for TechRepublic Premium. By clicking continue, you agree to these updated terms.
Invalid email/username and password combination supplied.
An email has been sent to you with instructions on how to reset your password.
By registering, you agree to the Terms of Use and acknowledge the data practices outlined in the Privacy Policy.
You will also receive a complimentary subscription to TechRepublic’s News and Special Offers newsletter and the Top Story of the Day newsletter. You may unsubscribe from these newsletters at any time.
All fields are required. Username must be unique. Password must be a minimum of 6 characters and have any 3 of the 4 items: a number (0 through 9), a special character (such as !, $, #, %), an uppercase character (A through Z) or a lowercase (a through z) character (no spaces).
Are companies linking document management into their big data strategies? They should be
Your email has been sent
Many big data implementations are leaving document management systems behind, but DMSes house major stores of unstructured data. Should data analysts think again?
The earliest document management systems (DMSes) appeared in the 1980s. They moved beyond physical file cabinets and PC server storage and appeared on networks where multiple people and departments within a single company could gain access to a trove of documents in electronic form.
Since then, document management systems have been the primary movers and shakers behind companies’ efforts to digitalize. These systems scan, index, store, retrieve and transform documents. They have been instrumental in moving paper-based documents and images out of file cabinets and storage rooms and onto widely distributed networks that everyone uses.
The question is: Are companies linking document management with their big data strategies?
In many cases, companies are lagging.
SEE: Microsoft Power Platform: What you need to know about it (free PDF) (TechRepublic)
The big data repositories that are being built combine systems of record data with incoming Internet of Things and outside source data that is unstructured. Document management systems are used in this process, but there isn’t necessarily a concerted effort in big data strategies to maximize all of the data in a DMS.
On the DMS side, users search data and digitize and organize it — but other big data technologies, such as data cleaning and normalization, artificial intelligence, machine learning and more advanced algorithm development, aren’t yet broadly used.
Of course, there are niche exceptions.
One of these exceptions is the legal discovery process that pores through reams of documents that are often housed in corporate document management systems. The goal of a legal discovery software is to analyze unstructured documents and to use AI and machine learning to determine which documents (out of thousands) are most relevant to a potential upcoming legal case, and which are not.
In this instance, there are no lengthy corporate arguments about whether it’s necessary to import documents into a big data repository from a DMS. The use case stands by itself.
However, in other cases, a compelling reason to mesh a DMS with a big data repository might not be there. For instance, does a genome sequencing experiment really rely on what a DMS would typically include?
SEE: Digital transformation: 3 things your organization can’t afford to overlook (TechRepublic)
The takeaway is not really whether a DMS is needed for a big data repository but simply that it should be considered. The DMS so often becomes an outlier for big data strategy because data scientists and IT data analysts have a tendency to overlook it.
What should companies do to ensure that their DMS systems are included as potential sources for information that flows into a big data repository? Here are four steps.
Are companies linking document management into their big data strategies? They should be
Your email has been sent
Your message has been sent
TechRepublic Premium content helps you solve your toughest IT issues and jump-start your career or next project.
Windows 11 gets an annual update on September 20 plus monthly extra features. In enterprises, IT can choose when to roll those out.
Edge AI offers opportunities for multiple applications. See what organizations are doing to incorporate it today and going forward.
This is a complete guide for Apple’s iPadOS. Find out more about iPadOS 16, supported devices, release dates and key features with our cheat sheet.
Discover data intelligence solutions for big data processing and automation. Read more to explore your options.
Whether you are a Microsoft Excel beginner or an advanced user, you’ll benefit from these step-by-step tutorials.
This document helps make sure that you address data governance practices for an efficient, comprehensive approach to data management. This checklist from TechRepublic Premium includes: an introduction to data governance, a data governance checklist and how to manage a data governance checklist. From this checklist’s introduction: Data governance is the process by which an organization …
Recruiting a Scrum Master with the right combination of technical expertise and experience will require a comprehensive screening process. This hiring kit provides a customizable framework your business can use to find, recruit and ultimately hire the right person for the job. This hiring kit from TechRepublic Premium includes a job description, sample interview questions …
Knowing the terminology associated with Web 3.0 is going to be vital to every IT administrator, developer, network engineer, manager and decision maker in business. This quick glossary will introduce and explain concepts and terms vital to understanding Web 3.0 and the technology that drives and supports it.
While the perfect color palette or the most sublime button shading or myriad of other design features play an important role in any product’s success, user interface design is not enough. Customer engagement and retention requires a strategic plan that attempts to measure, quantify and ultimately create a complete satisfying user experience on both an …