Monday, November 9, 2015

Tensor

Let’s look inside a Learning Machine - Google Tensor
https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi-5K0GDYQsrw_oRhC_2hQ0lOp5xC0n1GRxtnpViaFNmriieL3FWkHQRf9QJS_5W7ZMPilxA-MxibJO4A9dygaRSqj0x15klMEstjZ7JDbVtg282JnG8IRd4wTTBEfS828paM2UEQi-Vk4/s1600/cifar10_2.gif
http://www.tensorflow.org/
http://googleresearch.blogspot.com/2015/11/tensorflow-googles-latest-machine_9.html
http://googleresearch.blogspot.com/2015/11/computer-respond-to-this-email.html
https://www.youtube.com/watch?v=46Jzu-xWIBk
https://www.youtube.com/watch?v=gY9DewL6Dqk

Friday, November 6, 2015

Machine learning for better medical income on TAP

 Michael Draugelis, is chief data scientist, The University of Pennsylvania, Penn Medicine. He came to this gig in a way roundabout. You see, his wife went into shock while giving birth to their child, Chubsy Ubsy. Mother and child are doing well, but the experience made Draugelis wonder. That's because of his background in US Missile Defense Agency, where they did a lot of work about forecasting clues to impending events. He appeared at Strata + East to discuss all this.

When he got to Penn Med he set out to focus on Sepsis, an unfortunately leading cause of death for people who go to the hospital to fix something else. His data scientist efforts revolve around something called Penn Signals. "A nerd wonderland," says he. The point is to use evidence-based computing to see who is really really at risk. Right now!

Now, of course, in reality,  he is doing what all the machine learning people are doing:

We have some early proof of concepts that have created new signals..that we are feeding back into our algorithms.

What is next?

I hope to move on from prediction to auto reasoning.

That is where his work with Intel comes in.

The company, which would like to fight disease just as much as it would like to sell chips that go into servers on Web farms, has concluded that big data development, where egg headed data scientists create models in Python etc., that must be thrown over the wall, and rejiggered in Java by 'real' developers, and then sent back to fix, and then back over the wall again etc., was just going to take too much time. Why not more of a cloud development paradigm? Based on open source? Tested by Intel and the like of Penn? It's called TAP, for Trusted Analytics Platform.

Says our man Draugelis:

My data scientists need an environment that they can build quickly, select their analytic tools at scale, and a platform that can support it.  We have been excited to work with Intel to explore this new open source project called TAP. 

As a colleague said, or more precisely asked: " What was the last big software announcement Intel made? And what came of it? And yes the answer is The Intel Hadoop Distribution and what came of it was a ceding of the work to Cloudera.So to coin a phrase: Time will tell. - Jack Vaughan

For more 

Go to github to find out what it would be like to be a cloud developer these days.

Listen to a podcast where we talk with Intel's Vin Sharma about TAP

You see that embedded video above, this here is the linke to it, like actually. 


Saturday, October 24, 2015

AmaMart meets Walzon


Here in the Digital Age, business models continue to be buffeted, often in surprising ways.  The big, lever you push to sell your product – it seems – is data, analytics, and cloud (you figure which is rod, pivot, force). It plays out most vividly in the dialectic of Walmart and Amazon.
==
Walmart faces challenges. It gets harder to grow an operation when its revenues mount to almost $475 billion a year. Some folks would say Amazon.com, with Web services and large-scale cloud clusters, has created an e-commerce killer app aimed straight at the company atop the Fortune 500. It will be interesting to see if Walmart's coupling of Hadoop and data democracy will help it deflect such challenges. From Big data applications to driveWalmart reboot on SearchDataManagement.com

==div

An article by Jim Stewart in NYT talks about the models shaking as Walmart flattens and Amazon edges upward. Walmart has been working for 15 years to curb the e-commerce incursions of Amazon but effect is narrow.  Prof Greenwald (see below) says as an investor it rather see Walmart  team with an existing company doing analytics and cloud, than to continue to try to build from within. (It actually has made purchases, but not too notable.)


--div
When Walmart announced last week that it was significantly increasing its
investment in e­commerce, it tacitly acknowledged that it had fallen far behind
Amazon in the race for online customers.
====div
“The shift in retail to the Internet is a huge change, and it’s not just
affecting Walmart,” said Simeon Gutman, a retailing analyst for Morgan
Stanley.

==div
Every retail company is trying to manage the transition. It’s not well
defined or understood and there’s no road map. Walmart is just the biggest.
It’s a behemoth that was built on superstores with volume and distributionefficiencies. That whole model is being unwound.”===div
Mr. Gutman said: “Walmart pricing is decent, depending on the basket of
goods, but they used to be dominant on price. They’ve lost that. Pricing is verycomplex and no one is really executing this all that well. On the Internet, priceis transparent, and your price advantage competes away the minute you gainit. Consumers are demanding and getting the lowest possible prices.”====div
as Prof. Bruce C. Greenwald of the Columbia Business School
noted in his book “Competition Demystified.”
===div
Walmart’s superefficient distribution system — a function of its enormousvolume and geographic reach — was long the secret to Walmart’s immenseprofitability,--div
“Every retail company is trying to manage the transition. It’s not well
defined or understood and there’s no road map. Walmart is just the biggest.
It’s a behemoth that was built on superstores with volume and distributionefficiencies. That whole model is being unwound.”--div
Simeon Gutman,--div
“In theory, they should be able to use their immense volume anddistribution network to compete with Amazon,” he said. “But they’re not atechnology company, and I don’t know what makes them think they are.” Asan investor, he said, he would rather see Walmart team up with an existingtechnology company that already has the analytics and cloud computingcapacity, rather than try to build its own. “That doesn’t inspire confidence,” hesaid.
“Distribution is very hard to get right in retail,” he said. “Walmart’s worldis the giant superstore, where you have big pallets of merchandise moving totrucks moving it into the stores. You start talking about delivering individualitems to consumers, and they’re out of their comfort zone.”


To me thinking the pricing perhaps tells the deeper tale. Walmart sells John Grisham's Rogue Lawyer for $24.61; Amazon sells it for $17.38. UnderArmor at Walmart is $33.78; at Amazon it is $29.99. Wall Street jumped favorably (6.23%) this week when Amazon made 1/3rd of 1 percent profit for the quarter. It has lost money on everything (almost) it has ever sold – and with that scheme played out, it has turned to monetizing its cloud infrastructure. The cloud, said the Times, is raining money.

The big box was always a shell game where 'the tremendous volume kept things going'. Staples today is an example. Scorched earth policy toward mom and pop stationary stores – as the populace more out of of the town. Now what? After pressing manufacturing to move to China, no middle class to buy a life time supply of you name it.  - Jack Vaughan


http://www.nytimes.com/2015/10/23/technology/amazon-q3-earnings.html

http://searchdatamanagement.techtarget.com/opinion/Big-data-applications-to-drive-Walmart-reboot

A note on the comp: On one level this and the previous piece (on Machine Learning) kick around the notion of the Triple Store Tuple factoid graph. Which I hope to continue to learn about. When I read a story in the NYT, and the factoids* are popping, it looks like this (below) Next stop, however, is Las Vegas, and IBM Insight 2015.



*What did Ed Sanders call them, glyph clusters?






Sunday, October 11, 2015

Machine learning and science

Machine learning 

is finding its way into medical and scientific research in a variety of ways.

Two such paths are covered in a recent 

Talking Machines  Webcast.


Quaid Morris of the University of Toronto

speaks about using machine learning

to find better ways to treat cancers.


Also, Patrick Atwater discusses machine learning as a means to address issues of

California drought.


Both may have participated in the NIPS conference.

Wednesday, September 23, 2015

AirBnB Where are you?

Fig. 1 - Professionally bound neighborhoods
Web e-commerce has long been seen as a threat to traditional middlemen, but the threat it now poses businesses like cabs and hotels seem to raise the odds.  Uber takes the phone out of Louie the dispatchers hands, and AirBnB puts Web automation into the door-to-door process once known as 'can I crash at your pad.' Representing a new style of broker, the upstarts pose yet another threat level to those whose hegemony would be disrupted.

The role of the broker can rub at least two ways, maybe that is why it has been seen as a favorable position. History has seen brokers tend to favor one side over another in a transaction or two. Uber, for now, sets itself as arbiter of the cost of the ride – tho they might point to the black box if you ask them who decides. Uber could change its modus – it's still early.

How things could change may be seen in an AirBnB algorithm that has received some recent coverage. To hear tell, AirBnB saw what it could benefit if its customers on the providing housing side of the equation could come to a better estimate of the probable value of a night in their abode. While there are precedents in price advice systems from eBay and elsewhere, AirBnB claims somewhat convincingly that there approach is unique.  Still, they like some help with the estimator, so they have made it open source.

As depicted in an August 2015 IEEE Spectrum story, AirBnB set its secret sauce to percolating when the data science team began to calculating. Author Dan Hills writes that it converted the questions people ask when looking for a place to stay into machine learning algorithms. eBays' problem is different. With eBay, location is not really a factor. The timing is now, not 3 nights in October.  There's not a whole lot of difference between one good copy of Big Brother & the Holding Co.'s first Columbia LP and another.

AirBnB looked to create a tool that was dynamic,  that considered the unusual characteristics of a listing, and left room for human intuition when necessary.  I will classify that last bit as Surprise 1. Surprise 2 is that they use focus groups (no blind machine learning patriots, they) – and Surprise 3 is that they hired a professional  cartographer to hand draw accurate neighborhood boundaries for important world cities for travellers. [See Fig. 1]

I don’t know if this is a surprise.. but it is a bit amusing that author Hills previous gig was with a company called "Crash Padder" (itself bought by AirBnB). Amusing to this writer as he experience the crash pad experience first hand, coming late in the cycle when a sufficient number of people had been burned by thieving guests to put a serious lid on peace, love and  why not?

In Boston, and some other towns, Uber has already come to unwanted prominence for its capacity to bring on the wrong help. They and AirBnB will probably keep a team of data wonks busy for some time creating filters that bar the occasional felon flotsam from gumming up their march to ever higher evaluations. - Jack Vaughan, Boston

Related
http://spectrum.ieee.org/computing/software/the-secret-of-airbnbs-pricing-algorithm


 



Sunday, September 13, 2015

Doting on Data on Sunday Sept 13

NoSQL meets SLAs

As NoSQL technology continues its march into the enterprise, the story has a different tenor. Now we are talking models, and SLAs.

As Web-scale NoSQL technology looks to find a deeper footing in the enterprise, there may be as many stumbles as steps forward. That was an underlying theme at the NoSQL Now 2015 conference, where issues with service-level agreements (SLAs), data analytics challenges and a lack of skills were often part of the discussion.
...
Looking forward, what we may see is a branching, where raw, original-style NoSQL serves the needs of pedal-to-the-metal developers, and something else evolves to meet the stricter needs of enterprise shops.


Yes, some folks, like the bandits in The Treasure of the Sierra Madre, 'don’t need no stinkin' badges,' but, to the extent that Big Data is all about mining that Web trove, we will find people trying to bring SQL to NoSQL just as we do in Hadoop. - Jack Vaughan

Read the story on SearchDataManagement.com. 

Thursday, September 10, 2015

Hand-picked data ferment

The debate of clashing memes - The problem of hand-picked data is not new. Truth battles its way to surface or oblivion everyday. People smarter than their predecessors, or not, make decisions. Is something new a foot? No, you find it in a quote attributed to Mark Twains, and others. “People commonly use statistics like a drunk uses a lamp post; for support rather than illumination.” George Johnson's Raw Data column suggests this phenomenon is exacerbated by the Internet. Perhaps, perhaps. - Jack Vaughan

Monday, August 31, 2015

Check out out data preparation podcast