summaryrefslogtreecommitdiff
path: root/powered-by.md
diff options
context:
space:
mode:
Diffstat (limited to 'powered-by.md')
-rw-r--r--powered-by.md239
1 files changed, 239 insertions, 0 deletions
diff --git a/powered-by.md b/powered-by.md
new file mode 100644
index 000000000..5ecfafb4c
--- /dev/null
+++ b/powered-by.md
@@ -0,0 +1,239 @@
+---
+layout: global
+title: Powered By Spark
+type: "page singular"
+navigation:
+ weight: 5
+ show: true
+---
+
+<h2>Project and Product names using "Spark"</h2>
+
+Organizations creating products and projects for use with Apache Spark, along with associated
+marketing materials, should take care to respect the trademark in "Apache Spark" and its logo.
+Please refer to <a href="http://www.apache.org/foundation/marks/">ASF Trademarks Guidance</a> and
+associated <a href="http://www.apache.org/foundation/marks/faq/">FAQ</a>
+for comprehensive and authoritative guidance on proper usage of ASF trademarks.
+
+Names that do not include "Spark" at all have no potential trademark issue with the Spark project.
+This is recommended.
+
+Names like "Spark BigCoProduct" are not OK, as are names including "Spark" in general.
+The above links, however, describe some exceptions, like for names such as "BigCoProduct,
+powered by Apache Spark" or "BigCoProduct for Apache Spark".
+
+It is common practice to create software identifiers (Maven coordinates, module names, etc.)
+like "spark-foo". These are permitted. Nominative use of trademarks in descriptions is also
+always allowed, as in "BigCoProduct is a widget for Apache Spark".
+
+<h2>Companies and Organizations</h2>
+
+To add yourself to the list, please email `dev@spark.apache.org` with your organization name, URL,
+a list of which Spark components you are using, and a short description of your use case.
+
+- <a href="http://amplab.cs.berkeley.edu">UC Berkeley AMPLab</a> - Big data research lab that
+initially launched Spark
+ - We're building a variety of open source projects on Spark
+ - We have both graduate students and a team of professional software engineers working on the stack
+- <a href="http://4quant.com">4Quant</a>
+- <a href="http://www.actnowib.com">Act Now</a>
+ - Spark powers NOW APPS, a big data, real-time, predictive analytics platform. We use Spark SQL,
+ MLlib and GraphX components for both batch ETL and analytics applied to telecommunication data,
+ providing faster and more meaningful insights and actionable data to the operators.
+- <a href="http://adatao.com">Adatao, Inc.</a> - Data Intelligence for All
+ - Visual, Real-Time, Predictive Analytics on Spark+Hadoop, with built-in support for R, Python,
+ SQL, and Natural Language.
+ - Team of ex-Googlers and Yahoos with large-scale infrastructure experience
+ (including both flavors of MapReduce at Google and Yahoo) and PhD's in ML/Data Mining
+ - Determined that Spark, among the many alternatives, answered the right problem statements with
+ the right design
+- <a href="http://www.agilelab.it">Agile Lab</a>
+ - enhancing big data. 360 customer view, log analysis, BI
+- <a href="http://www.taobao.com/">Alibaba Taobao</a>
+ - We built one of the world's first Spark on YARN production clusters.
+ - See our blog posts (in Chinese) about Spark at Taobao:
+ <a href="http://rdc.taobao.org/?tag=spark">http://rdc.taobao.org/?tag=spark</a>
+- <a href="http://alpinenow.com/">Alpine Data Labs</a>
+- <a href="http://amazon.com">Amazon</a>
+- <a href="http://www.amrita.edu/cyber/">Amrita Center for Cyber Security Systems and Networks</a>
+- <a href="http://www.art.com/">Art.com</a>
+ - Trending analytics and personalization
+- <a href="http://www.asiainfo.com">AsiaInfo</a>
+ - We are using Spark Core, Streaming, MLlib and Graphx. We leverage Spark and Hadoop ecosystem
+ to build cost effective data center solution for our customer in telco industry as well as
+ other industrial sectors.
+- <a href="http://www.atigeo.com">Atigeo</a> – integrated Spark in xPatterns, our big data
+analytics platform, as a replacement for Hadoop MR
+- <a href="https://atp.io">atp</a>
+ - Predictive models and learning algorithms to improve the relevance of programmatic marketing.
+ - Components used: Spark SQL, MLLib.
+- <a href="http://www.autodesk.com">Autodesk</a>
+- <a href="http://www.baidu.com">Baidu</a>
+- <a href="http://www.bakdata.com/">Bakdata</a> – using Spark (and Shark) to perform interactive
+exploration of large datasets
+- <a href="http://http//www.bigindustries.be/">Big Industries</a> - using Spark Streaming: The
+Big Content Platform is a business-to-business content asset management service providing a
+searchable, aggregated source of live news feeds, public domain media and archives of content.
+- <a href="http://www.bizo.com">Bizo</a>
+ - Check out our talk on <a href="http://www.meetup.com/spark-users/events/139804022/">Spark at Bizo</a>
+ at Spark user meetup
+- <a href="http://www.celtra.com">Celtra</a>
+- <a href="http://www.clearstorydata.com">ClearStory Data</a> – ClearStory's platform and
+integrated Data Intelligence application leverages Spark to speed analysis across internal
+and external data sources, driving holistic and actionable insights.
+- <a href="https://www.concur.com">Concur</a>
+ - Spark SQL, MLlib
+ - Using Spark for travel and expenses analytics and personalization<
+- <a href="http://www.contentsquare.com">Content Square</a>
+ - We use Spark to regularly read raw data, convert them into Parquet, and process them to
+ create advanced analytics dashboards: aggregation, sampling, statistics computations,
+ anomaly detection, machine learning.
+- <a href="http://www.conviva.com">Conviva</a> – Experience Live
+ - See our talk at <a href="http://ampcamp.berkeley.edu/3/">AmpCamp</a> on how we are
+ <a href="http://www.youtube.com/watch?feature=player_detailpage&v=YaayAatdRNs">using Spark to
+ provide real time video optimization</a>
+- <a href="https://www.creditkarma.com/">Credit Karma</a>
+ - We create personalized experiences using Spark.
+- <a href="http://databricks.com">Databricks</a>
+ - Formed by the creators of Apache Spark and Shark, Databricks is working to greatly expand these
+ open source projects and transform big data analysis in the process. We're deeply committed to
+ keeping all work on these systems open source.
+ - We provided a hosted service to run Spark,
+ <a href="http://www.databricks.com/cloud">Databricks Cloud</a>, and partner to
+ <a href="http://databricks.com/support/">support Apache Spark</a> with other Hadoop and big
+ data companies.
+- <a href="http://dianping.com">Dianping.com</a>
+- <a href="http://www.digby.com">Digby</a>
+- <a href="http://www.drawbrid.ge/">Drawbridge</a>
+- <a href="http://www.ebay.com/">eBay Inc.</a>
+ - Using Spark core for log transaction aggregation and analytics
+- <a href="http://labs.elsevier.com">Elsevier Labs</a>
+ - Use Case: Building Machine Reading Pipeline, Knowledge Graphs, Content as a Service, Content
+ and Event Analytics, Content/Event based Predictive Models and Big Data Processing.
+ - We use Scala and Python over Databricks Notebooks for most of our work.
+- <a href="http://www.eurecom.fr/en">EURECOM</a>
+- <a href="http://www.exabeam.com">Exabeam</a>
+- <a href="http://www.faimdata.com/">Faimdata</a>
+ - Build eCommerce and data intelligence solutions to the retail industry on top of
+ Spark/Shark/Spark Streaming
+- <a href="http://falkonry.com">Falkonry</a>
+- <a href="http://www.flytxt.com">Flytxt</a>
+ - Big Data analytics for subscriber profiling and personalization in telecommunications domain.
+ We are using Spark Core and MLlib.
+- <a href="http://www.jeremyfreeman.net">Freeman Lab at HHMI</a>
+ - We are using Spark for analyzing and visualizing patterns in large-scale recordings of brain
+ activity in real time
+- <a href="http://www.fundacionctic.org">Fundacion CTIC</a>
+- <a href="http://graphflow.com">GraphFlow, Inc.</a>
+- <a href="http://www.groupon.com/app/subscriptions/new_zip?division_p=san-francisco">Groupon</a>
+- <a href="http://www.guavus.com/">Guavus</a>
+ - Stream processing of network machine data
+- <a href="http://www.hitachi-solutions.com/">Hitachi Solutions</a>
+- <a href="http://hivedata.com/">The Hive</a>
+- <a href="http://www.research.ibm.com/labs/almaden/index.shtml">IBM Almaden</a>
+- <a href="http://www.infoobjects.com">InfoObjects</a>
+ - Award winning Big Data consulting company with focus on Spark and Hadoop
+- <a href="http://en.inspur.com">Inspur</a>
+- <a href="http://www.sehir.edu.tr/en/">Istanbul Sehir University</a>
+- <a href="http://www.kenshoo.com/">Kenshoo</a>
+ - Digital marketing solutions and predictive media optimization
+- <a href="http://www.kelkoo.co.uk">Kelkoo</a>
+ - Using Spark Core, SQL, and Streaming. Product recommendations, BI and analytics,
+ real-time malicious activity filtering, and data mining.
+- <a href="http://www.knoldus.com">Knoldus Software LLC</a>
+- <a href="http://eng.localytics.com">Localytics</a>
+ - Batch, real-time, and predictive analytics driving our mobile app analytics and marketing
+ automation product.
+ - Components used: Spark, Spark Streaming, MLLib.
+- <a href="http://magine.com">Magine TV</a>
+- <a href="http://mediacrossing.com">MediaCrossing</a> – Digital Media Trading Experts in the
+New York and Boston areas
+ - We are using Spark as a drop-in replacement for Hadoop Map/Reduce to get the right answer
+ to our queries in a much shorter amount of time.
+- <a href="http://www.myfitnesspal.com/">MyFitnessPal</a>
+ - Using Spark to clean-up user entered food data using both explicit and implicit user signals
+ with the final goal of identifying high-quality food items.
+ - Using Spark to build different recommendation systems for recipes and foods.
+- <a href="http://deepspace.jpl.nasa.gov/">NASA JPL - Deep Space Network</a>
+- <a href="http://www.163.com/">Netease</a>
+- <a href="http://www.nflabs.com">NFLabs</a>
+- <a href="http://nsn.com">Nokia Solutions and Networks</a>
+- <a href="http://www.nttdata.com/global/en/">NTT DATA</a>
+- <a href="http://www.nubetech.co">Nube Technologies</a>
+ - Nube provides solutions for data curation at scale helping customer targeting, accurate
+ inventory and efficient analysis.
+- <a href="http://ooyala.com">Ooyala, Inc.</a> – Powering personalized video experiences
+across all screens
+ - See our blog post on how we use
+ <a href="http://engineering.ooyala.com/blog/fast-spark-queries-memory-datasets">Spark for
+ Fast Queries</a>
+ - See our presentation on
+ <a href="http://www.slideshare.net/EvanChan2/cassandra2013-spark-talk-final">Cassandra, Spark,
+ and Shark</a>
+- <a href="http://www.opentable.com/">Opentable</a>
+ - Using Apache Spark for log processing and ETL. The data obtained feeds the recommender
+ system powered by Spark MLLIB Matrix Factorization. We are evaluating the use of Spark
+ Streaming for real-time analytics.
+- <a href="http://pantera.io">PanTera</a>
+ - PanTera is a tool for exploring large datasets. It uses Spark to create XY and geographic
+ scatterplots from millions to billions of datapoints.
+ - Components we are using: Spark Core (Scala API), Spark SQL, and GraphX
+- <a href="http://www.peerialism.com">Peerialism</a>
+- <a href="http://www.planbmedia.com">PlanBMedia</a>
+- <a href="http://prediction.io/">PredicitionIo</a> - PredictionIO currently offers two engine
+templates for Apache Spark MLlib for recommendation (MLlib ALS) and classification (MLlib Naive
+Bayes). With these templates, you can create a custom predictive engine for production deployment
+efficiently.
+- <a href="http://premise.com">Premise</a>
+- <a href="http://www.quantifind.com">Quantifind</a>
+- <a href="http://radius.com">Radius Intelligence</a>
+ - Using Scala, Spark and MLLib for Radius Marketing and Sales intelligence platform including
+ data aggregation, data processing, data clustering, data analysis and predictive modeling of all
+ US businesses.
+- <a href="http://www.realimpactanalytics.com/">Real Impact Analytics</a>
+ - Building large scale analytics platforms for telecoms operators
+- <a href="http://rocketfuel.com/">RocketFuel</a>
+- <a href="http://www.rondhuit.com/">RONDHUIT</a>
+ - Machine Learning with Apache Mahout and Spark
+ <a href="http://www.rondhuit.com/services/training/mahout-ML.html">http://www.rondhuit.com/services/training/mahout-ML.html</a>
+- <a href="http://www.sailthru.com/">Sailthru</a>
+ - Uses Spark to build predictive models and recommendation systems for marketing automation
+ and personalization.
+- <a href="http://www.sisa.samsung.com/">Samsung Research America</a>
+- <a href="http://www.shopify.com/">Shopify</a>
+- <a href="http://www.simba.com/">Simba Technologies</a>
+ - BI/reporting/ETL for Spark and beyond
+- <a href="http://www.sinnia.com">Sinnia</a>
+- <a href="http://www.sktelecom.com/en/main/index.do">SK Telecom</a>
+ - SK Telecom analyses mobile usage patterns of customer with Spark and Shark.
+- <a href="http://socialmetrix.com/">Socialmetrix</a>
+- <a href="http://www.sohu.com">Sohu</a>
+- <a href="http://www.stratio.com/">Stratio</a>
+ - Offers an open-source Big Data platform centered around Apache Spark.
+- <a href="https://www.taboola.com/">Taboola</a> – Powering 'Content You May Like' around the web
+- <a href="http://www.techbase.com.tr">Techbase</a>
+- <a href="http://tencent.com/">Tencent</a>
+- <a href="http://www.tetraconcepts.com/">Tetra Concepts</a>
+- <a href="http://www.trendmicro.com/us/index.html">TrendMicro</a>
+- <a href="http://engineering.tripadvisor.com/using-apache-spark-for-massively-parallel-nlp/">TripAdvisor</a>
+- <a href="http://truedash.io">truedash</a>
+ - Automatic pulling of all your data in to Spark for enterprise visualisation, predictive
+ analytics and data exploration at a low cost.
+- <a href="http://www.trueffect.com">TruEffect Inc</a>
+- <a href="http://www.tuplejump.com">Tuplejump</a>
+ - Software development partners for Apache Spark and Cassandra projects
+- <a href="http://www.ucsc.edu">UC Santa Cruz</a>
+- <a href="http://missouri.edu/">University of Missouri Data Analytics and Discover Lab</a>
+- <a href="http://videoamp.com/">VideoAmp</a>
+ - Intelligent video ads for online and television viewing audiences.
+- <a href="http://www.vistarmedia.com">Vistar Media</a>
+ - Location technology company enabling brands to reach on-the-go consumers
+- <a href="http://www.yahoo.com">Yahoo!</a>
+- <a href="http://www.yandex.com">Yandex</a>
+ - Using Spark in
+ <a href="http://www.searchenginejournal.com/yandex-islands-markup-issues-implementation/71891/">Yandex Islands</a>,
+ to process islands identified from a search robor
+- <a href="http://www.zaloni.com/products/">Zaloni</a>
+ - Zaloni's data lake management platform (Bedrock) and self-service data preparation solution
+ (Mica) leverage Spark for fast execution of transformations and data exploration.
+ \ No newline at end of file