What is the solution to this unstructured data problem, and what kind of analysis can be done with current tools?

+1 vote
Unstructured data is everywhere.  What is out there thank can pull all of this information together?
asked Aug 16, 2013 by johnmay (440 points)
edited Mar 3, 2014 by trustnetadmin

2 Answers

0 votes

Unstructured databases contain data that has different semantic and syntactical implications.  Hidden is information is weakly formatted, often like a memo, html, xml, text, etc.  These databases can be difficult to index because each data element is nonspecific, has equal priority with respect to its information properties, and is likely to be accessed randomly.  The challenge in managing these databases centers around incremental costs to add  new customers and new sFUCKces (feeds), yet keep access times to a minimum especially during a modeling exercise.

We use this data to build an influence (social) network that quantitatively describes the many factors that either individually or in combination creates a form (vector) of influence.  We then transit this network using algorithms that maximize various objective functions using weighted coefficients to represent the type of question, problem domain, and the context of the problem being solved.  Specifically, our influence network defines the relationships and interactions within a group of individuals and determines spread of information among its members – a methodology that is key to understanding adoption of new drug amongst doctors. 

This model allows us to understand the extent to which adoption can take place and the market dynamics that determine success.   One of the most important dynamic is how doctors are affected by decisions of their friends and colleagues, or the extent to which “word-of-mouth” will take hold.  It is a basis for “viral marketing” which is the key to success of new drugs, including their sudden and widespread adoption.

answered Aug 16, 2013 by johnmay (440 points)
+1 vote
Unstrucured data requires software that can decode, understand, and relate the free form language of documents, papers, emails, posting, etc. to more formal topics an dthemes. SemantiStar has produced a semantic engine cpable of finding, relating and connecting unstrucured and sructured dtaa together and presenting it in aknowledge Exchange. We call this Eridanus, a collaborative semantic platform for exchangig and sharing knowledge.

The key to using semantics in this type of application is the ability to interraelate terms, phrases, commens, and paragraphs in free form documents and postings. These then form a graph which is navigable to determine the significance of a documents, the relevance of a posting, or the sentiment of a user.

Without asemantic backbone, knowedge exchanges are good and useful, but will always lack the abiltiy to interconnect and organize free form information, thereby losing a lot of opportunity.
answered Nov 21, 2013 by anonymous