Deep Web University
Discover over 200 articles we’ve written about the Deep Web in our Deep Web University.
If you’re hoping to learn more about our harvesting technology, you’re in the right place. In this walk-through of our harvesting process, we hope to answer some of the most common questions about our harvesting technology and take you behind the scenes of our process. The following content is fairly technical, if you want a higher level overview of our process, reference our Data-as-a-Service page.
We’ve developed a patented Deep Web Harvester that allows us to harvest data from web pages using pre-existing harvest types. BrightPlanet’s Deep Web Harvester contains the following harvests. All of BrightPlanet’s harvests are highly scalable and allow for the harvesting and collection of thousands of web pages.
Once the Deep Web Harvester identifies the page to harvest, the curation process begins. The curation process involves structuring the data and converting it from a web page. The process begins with extracting all the text from the page and placing it into a completely unstructured format.
The data is now ready to structure. We structure data using a process called entity extraction. Entity extraction involves identifying key terms within a web page. These typically involve the names of people, companies, and places. Extraction uses a rule-based engine, which means we can also customize entities for each customer to tag the names of your products, and other terms important to you.
Now that the data is harvested and curated, it’s ready to then begin asking the data questions. BrightPlanet customers typically interact with harvested data one of three ways.
Explore the Deep Web and BrightPlanet’s processes with Data Acquisition Engineer Jamie Martin as your guide.
See how other companies are taking advantage of BrightPlanet’s Data-as-a-Service for their business.
Ready to get started? Our Deep Review is a great place to start. You’ll get direct access to our engineering and consulting team in a 6 week funded proof of concept using your actual data.
Schedule a free consultation with a BrightPlanet® Data Acquisition Engineer today.
Discover over 200 articles we’ve written about the Deep Web in our Deep Web University.
See how real businesses are using BrightPlanet’s technology to develop their own insights.
Schedule a free consultation with a BrightPlanet Data Acquisition Engineer today.