Sneha Girap (Editor)

Gregory Piatetsky Shapiro

Updated on
Edit
Like
Comment
Share on FacebookTweet on TwitterShare on LinkedInShare on Reddit
Name
  
Gregory Piatetsky-Shapiro


Gregory Piatetsky-Shapiro Interview with Data Scientist Gregory PiatetskyShapiro PhD


Gregory piatetsky shapiro kdnuggets president kdnuggets a top big data data science influencer


Gregory I. Piatetsky-Shapiro (born 7 April 1958) is a data scientist and is the co-founder of the KDD conference and the ACM SIGKDD association for Knowledge Discovery and Data Mining. He is also the founder, president and current editor of KDnuggets.com a website for learning and discussion on the areas of Business Analytics, Data Mining, and Data Science.

Contents

Gregory Piatetsky-Shapiro httpsd32gmh0vc2w78ncloudfrontnetwpcontentu

Early life

Piatetsky was born to a Jewish-Russian family in Moscow, Russia. His father, Ilya Piatetski-Shapiro, was a well-known mathematician. He was admitted in 1970 to the Physics-Mathematics School, one of the most prestigious schools in Moscow.

In March 1974, Piatetsky emigrated to Israel with his mother, Inna. There, he studied mathematics at Tel-Aviv University, and computer science for one semester at Technion. He then received an MS (1979) and Ph.D. (1984) from NYU Courant Institute. His first paper, published in SIGMOD in 1984, proved that secondary index selection is NP-complete by reducing it to set cover problem. However, in his dissertation he proved that the greedy method for set cover has a lower bound of 1 - 1/e ~ 63% of the optimal.

Career

After getting his Ph.D. in 1985, he joined GTE Laboratories in 1985, where he worked on intelligent interfaces to databases. In 1989, he proposed a new project at GTE called "Knowledge Discovery in Databases." The project created a number of advanced prototypes, including KEFIR (Key Findings Reporter), a system for analysis and summarization of key changes in large databases, which was a forerunner of systems like Google Analytics Intelligence. A KEFIR prototype was applied to GTE health care data and received GTE's highest technical award.

In 1997, he left GTE to join Knowledge Stream Partners (KSP), where he was Director and later Vice President and Chief Scientist. In April 2000, KSP was acquired by Xchange, Inc., where Piatetsky served as VP and Chief Scientist.

Piatetsky left Xchange in May 2001 to become a self-employed consultant and is now focused on running KDnuggets [6].

KDD and SIGKDD

In 1989, Piatetsky organized the first workshop on Knowledge Discovery in Data (KDD-89), held at IJCAI-1989 in Detroit, MI. This workshop had over 60 attendees, including researchers Ross Quinlan and Jaime Carbonell.

Piatetsky organized the next two KDD workshops in 1991 (Anaheim, CA) and 1993 (Washington, DC). With Usama Fayyad and Ramasamy (Sam) Uthurusamy, he grew the workshops into an annual international conference on Data Mining and was the General Chair of the KDD-98 conference.[7]. He served as the chair of the KDD Steering committee until 1998, when the SIGKDD group was formed as part of ACM to run the annual KDD conference and help promote research in Knowledge Discovery and Data Mining. He served as Director of SIGKDD for 2001-2005 and as SIGKDD Chair for 2005-2009.

In 1997, Piatetsky and Ismail Parsa initiated the KDD Cup competition, which was the first open data mining contest in the world.[8]

The annual ACM SIGKDD conference is the leading research conference on Knowledge Discovery and Data Mining, according to Microsoft Academic search and Google Scholar. The 21st ACM SIGKDD conference was held in Sydney, Australia August 2015.

KDnuggets

In 1993, Piatetsky started Knowledge Discovery Nuggets (KDnuggets) as a newsletter to connect researchers who attended the KDD-93 workshop. With the emergence of the Internet and Mosaic, he and Chris Matheus eventually created a website: Knowledge Discovery Mine, hosted at GTE Labs. The newsletter served as an unofficial publication of KDD workshops. When Piatetsky left GTE Labs in 1997, he created the KDnuggets website, which stands for Knowledge Discovery Nuggets, with the mission of covering the field with short, concise "nuggets". The resource started as a directory of main areas of data mining and data science, including Software, Jobs, Academic positions, CFP (calls for papers), Companies, Courses, Datasets, Education, Meetings, Publications, and Webcasts.

Today, KDnuggets' main focus is to cover the news in the field of Business Analytics, Data Mining, and Data Science, including interviews with many key leaders of the field. It also offers a free data mining course for advanced undergraduates or first-year graduate students.

@KDnuggets Twitter was

  1. Voted the Best Big Data Tweeter by Big Data Republic (2013)
  2. In Top 10 Most Influential Brands on Big Data, Onalytica, May 2017.
  3. No. 1 in Agilience Top Authorities in Machine Learning, Nov 2016.
  4. No. 1 in Agilience Top Authorities for Data Mining, No. 2 for Data Science, Nov 2016.
  5. No. 3 in AI Intelligence & Machine Learning: Top 100 Influencers and Brands, Onalytica, Mar 2016.
  6. No. 4 in Big Data 2016: Top 100 Influencers, Onalytica, Feb 2016.
  7. In InformationWeek Twitter Top 10 Data Science, Analytics, And BI Feeds, Jan 2016

In February 2015, Piatetsky and Data ScienceTech Institute announced a partnership and he became an Honorary Member of its Scientific Advisory Board.

Research and Publications

In 1991, Piatetsky and William (Bud) Frawley edited the first book on Knowledge Discovery in Databases, and in 1996, Gregory Piatetsky, Usama Fayyad, Padhraic Smyth, and R. Uthurusamy edited a follow-up Advances in Knowledge Discovery and Data Mining

Piatetsky also helped launch and was a co-editor of Data Mining and Knowledge Discovery journal. He has 9 edited books and collections and over 60 technical papers, articles and book chapters, mostly focusing on data mining and knowledge discovery.

Awards and recognition

  • 1984, NYU Award for Best Dissertation in Computer Sciences, Ph.D. Thesis: "A Self-Organizing Database System - A Different Approach to Query Optimization".
  • 1985, NYU Award for Best Dissertation in all Natural Sciences (1985).
  • 1995, Leslie H. Warner award—GTE's highest for technical achievement—for the KEFIR system.
  • 2000, First SIGKDD Service Award, for contributions to Data Mining and Knowledge Discovery.
  • 2007 IEEE ICDM Outstanding Service Award, for major contributions to data mining field, 2007.
  • References

    Gregory Piatetsky-Shapiro Wikipedia