Latest from Pandera Systems

Leaders in Enterprise Decision Automation®.

Categories

Data Science

Fake News and the Responsibility of Data Scientists

Fake News and the Responsibility of Data Scientists

95% of statistics are made up.

Discussions about fact versus truth come up quite a bit these days, especially with the proliferation of “fake news” and the news media’s coverage of certain facts (and non-facts, for that matter). We hear chatter all over the place where folks bring up hot button issues and claim “90% of people agree with my fact!”and “I have proof that X happened — it’s a fact!” Sites like snopes.com, politifact.com, and factcheck.org exist to remove political bias and interpretation from these “facts,” but why would we need such things? Is it because reporters actively make up information and deliberately lie to viewers? Or could it be that they simply tweak hard facts just enough to fit them into their preferred narrative?

(more…)

Snowflake for Data Science

Anyone working on a cloud-friendly data science, data engineering, or data warehousing team has surely heard the name Snowflake come up over the past couple of years.

For those that haven’t, Snowflake is a relatively new database solution that is majorly innovative in some ways and yet simultaneously familiar in the ways that matter.

The product is essentially a SaaS database built with cloud-native features that we 21st century data enthusiasts (fine… nerds) have come to expect from off-the-shelf products. Some of these features include:

  • the ability to quickly and automatically scale computing power — responding to highly variable workloads in a way that enables high velocity data to flow in, unimpeded by ingestion bottlenecks
  • the separation of storage and compute, both technically and on billing statements — making cold data retention much more cost-effective
  • the ability to segregate and securely share chunks of data — reducing the amount of maintenance and management that data ops teams need to invest in shared data assets
  • automatic query and data optimization — this one speaks for itself!

(more…)

Pandera to Bring the Southern Data Science Conference to Orlando

Orlando FL – Pandera Systems, a global provider of information delivery solutions and analytics innovation consulting company has announced its partnership with the Southern Data Science Conference. This partnership will bring a robust roster to Orlando, Florida on October 17th and 18th for two days of cutting-edge discussions, workshops and networking at the Renaissance Orlando at Seaworld. This innovative, single track conference has drawn attention from around the World and will boast speakers from both top data science companies, like Google, Facebook, Microsoft, etc., and top research institutions, like Allen Institute for Artificial Intelligence, UCF, and more. These speakers have been selected based upon their outstanding contributions to the data science community and we are ecstatic to bring them all under one roof. Beyond that, we are extremely fortunate to have such great community support from the City of Orlando, University of Central Florida, South Big Data Hub, and Orlando Tech and Beer.

(more…)