summaryrefslogtreecommitdiff
path: root/powered-by.md
blob: 5ecfafb4c60504bc14fbdd39adff51ddc7af8d14 (plain) (blame)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
---
layout: global
title: Powered By Spark
type: "page singular"
navigation:
  weight: 5
  show: true
---

<h2>Project and Product names using "Spark"</h2>

Organizations creating products and projects for use with Apache Spark, along with associated 
marketing materials, should take care to respect the trademark in "Apache Spark" and its logo. 
Please refer to <a href="http://www.apache.org/foundation/marks/">ASF Trademarks Guidance</a> and 
associated <a href="http://www.apache.org/foundation/marks/faq/">FAQ</a> 
for comprehensive and authoritative guidance on proper usage of ASF trademarks.

Names that do not include "Spark" at all have no potential trademark issue with the Spark project. 
This is recommended.

Names like "Spark BigCoProduct" are not OK, as are names including "Spark" in general. 
The above links, however, describe some exceptions, like for names such as "BigCoProduct, 
powered by Apache Spark" or "BigCoProduct for Apache Spark".

It is common practice to create software identifiers (Maven coordinates, module names, etc.) 
like "spark-foo". These are permitted. Nominative use of trademarks in descriptions is also 
always allowed, as in "BigCoProduct is a widget for Apache Spark".

<h2>Companies and Organizations</h2>

To add yourself to the list, please email `dev@spark.apache.org` with your organization name, URL, 
a list of which Spark components you are using, and a short description of your use case.

- <a href="http://amplab.cs.berkeley.edu">UC Berkeley AMPLab</a> - Big data research lab that 
initially launched Spark
  - We're building a variety of open source projects on Spark
  - We have both graduate students and a team of professional software engineers working on the stack
- <a href="http://4quant.com">4Quant</a>
- <a href="http://www.actnowib.com">Act Now</a>
  - Spark powers NOW APPS, a big data, real-time, predictive analytics platform. We use Spark SQL, 
  MLlib and GraphX components for both batch ETL and analytics applied to telecommunication data, 
  providing faster and more meaningful insights and actionable data to the operators.
- <a href="http://adatao.com">Adatao, Inc.</a> - Data Intelligence for All
  - Visual, Real-Time, Predictive Analytics on Spark+Hadoop, with built-in support for R, Python, 
  SQL, and Natural Language.
  - Team of ex-Googlers and Yahoos with large-scale infrastructure experience 
  (including both flavors of MapReduce at Google and Yahoo) and PhD's in ML/Data Mining
  - Determined that Spark, among the many alternatives, answered the right problem statements with 
  the right design
- <a href="http://www.agilelab.it">Agile Lab</a>
  - enhancing big data. 360 customer view, log analysis, BI
- <a href="http://www.taobao.com/">Alibaba Taobao</a>
  - We built one of the world's first Spark on YARN production clusters.
  - See our blog posts (in Chinese) about Spark at Taobao: 
  <a href="http://rdc.taobao.org/?tag=spark">http://rdc.taobao.org/?tag=spark</a>
- <a href="http://alpinenow.com/">Alpine Data Labs</a>
- <a href="http://amazon.com">Amazon</a>
- <a href="http://www.amrita.edu/cyber/">Amrita Center for Cyber Security Systems and Networks</a>
- <a href="http://www.art.com/">Art.com</a>
  - Trending analytics and personalization
- <a href="http://www.asiainfo.com">AsiaInfo</a>
  - We are using Spark Core, Streaming, MLlib and Graphx. We leverage Spark and Hadoop ecosystem 
  to build cost effective data center solution for our customer in telco industry as well as 
  other industrial sectors.
- <a href="http://www.atigeo.com">Atigeo</a> – integrated Spark in xPatterns, our big data 
analytics platform, as a replacement for Hadoop MR
- <a href="https://atp.io">atp</a>
  - Predictive models and learning algorithms to improve the relevance of programmatic marketing.
  - Components used: Spark SQL, MLLib.
- <a href="http://www.autodesk.com">Autodesk</a>
- <a href="http://www.baidu.com">Baidu</a>
- <a href="http://www.bakdata.com/">Bakdata</a> – using Spark (and Shark) to perform interactive 
exploration of large datasets
- <a href="http://http//www.bigindustries.be/">Big Industries</a> - using Spark Streaming: The 
Big Content Platform is a business-to-business content asset management service providing a 
searchable, aggregated source of live news feeds, public domain media and archives of content.
- <a href="http://www.bizo.com">Bizo</a>
  - Check out our talk on <a href="http://www.meetup.com/spark-users/events/139804022/">Spark at Bizo</a> 
  at Spark user meetup
- <a href="http://www.celtra.com">Celtra</a>
- <a href="http://www.clearstorydata.com">ClearStory Data</a> – ClearStory's platform and 
integrated Data Intelligence application leverages Spark to speed analysis across internal 
and external data sources, driving holistic and actionable insights.
- <a href="https://www.concur.com">Concur</a>
  - Spark SQL, MLlib
  - Using Spark for travel and expenses analytics and personalization<
- <a href="http://www.contentsquare.com">Content Square</a>
  - We use Spark to regularly read raw data, convert them into Parquet, and process them to 
  create advanced analytics dashboards: aggregation, sampling, statistics computations, 
  anomaly detection, machine learning.
- <a href="http://www.conviva.com">Conviva</a> – Experience Live
  - See our talk at <a href="http://ampcamp.berkeley.edu/3/">AmpCamp</a> on how we are 
  <a href="http://www.youtube.com/watch?feature=player_detailpage&v=YaayAatdRNs">using Spark to 
  provide real time video optimization</a>
- <a href="https://www.creditkarma.com/">Credit Karma</a>
  - We create personalized experiences using Spark.
- <a href="http://databricks.com">Databricks</a>
  - Formed by the creators of Apache Spark and Shark, Databricks is working to greatly expand these 
  open source projects and transform big data analysis in the process. We're deeply committed to 
  keeping all work on these systems open source.
  - We provided a hosted service to run Spark, 
  <a href="http://www.databricks.com/cloud">Databricks Cloud</a>, and partner to 
  <a href="http://databricks.com/support/">support Apache Spark</a> with other Hadoop and big 
  data companies.
- <a href="http://dianping.com">Dianping.com</a>
- <a href="http://www.digby.com">Digby</a>
- <a href="http://www.drawbrid.ge/">Drawbridge</a>
- <a href="http://www.ebay.com/">eBay Inc.</a>
  - Using Spark core for log transaction aggregation and analytics
- <a href="http://labs.elsevier.com">Elsevier Labs</a>
  - Use Case: Building Machine Reading Pipeline, Knowledge Graphs, Content as a Service, Content 
  and Event Analytics, Content/Event based Predictive Models and Big Data Processing.
  - We use Scala and Python over Databricks Notebooks for most of our work.
- <a href="http://www.eurecom.fr/en">EURECOM</a>
- <a href="http://www.exabeam.com">Exabeam</a>
- <a href="http://www.faimdata.com/">Faimdata</a>
  - Build eCommerce and data intelligence solutions to the retail industry on top of 
  Spark/Shark/Spark Streaming
- <a href="http://falkonry.com">Falkonry</a>
- <a href="http://www.flytxt.com">Flytxt</a>
  - Big Data analytics for subscriber profiling and personalization in telecommunications domain. 
  We are using Spark Core and MLlib.
- <a href="http://www.jeremyfreeman.net">Freeman Lab at HHMI</a>
  - We are using Spark for analyzing and visualizing patterns in large-scale recordings of brain 
  activity in real time
- <a href="http://www.fundacionctic.org">Fundacion CTIC</a>
- <a href="http://graphflow.com">GraphFlow, Inc.</a>
- <a href="http://www.groupon.com/app/subscriptions/new_zip?division_p=san-francisco">Groupon</a>
- <a href="http://www.guavus.com/">Guavus</a>
  - Stream processing of network machine data
- <a href="http://www.hitachi-solutions.com/">Hitachi Solutions</a>
- <a href="http://hivedata.com/">The Hive</a>
- <a href="http://www.research.ibm.com/labs/almaden/index.shtml">IBM Almaden</a>
- <a href="http://www.infoobjects.com">InfoObjects</a>
  - Award winning Big Data consulting company with focus on Spark and Hadoop
- <a href="http://en.inspur.com">Inspur</a>
- <a href="http://www.sehir.edu.tr/en/">Istanbul Sehir University</a>
- <a href="http://www.kenshoo.com/">Kenshoo</a>
  - Digital marketing solutions and predictive media optimization
- <a href="http://www.kelkoo.co.uk">Kelkoo</a>
  - Using Spark Core, SQL, and Streaming. Product recommendations, BI and analytics, 
  real-time malicious activity filtering, and data mining.
- <a href="http://www.knoldus.com">Knoldus Software LLC</a>
- <a href="http://eng.localytics.com">Localytics</a>
  - Batch, real-time, and predictive analytics driving our mobile app analytics and marketing 
  automation product.
  - Components used: Spark, Spark Streaming, MLLib.
- <a href="http://magine.com">Magine TV</a>
- <a href="http://mediacrossing.com">MediaCrossing</a> – Digital Media Trading Experts in the 
New York and Boston areas
  - We are using Spark as a drop-in replacement for Hadoop Map/Reduce to get the right answer 
  to our queries in a much shorter amount of time.
- <a href="http://www.myfitnesspal.com/">MyFitnessPal</a>
  - Using Spark to clean-up user entered food data using both explicit and implicit user signals 
  with the final goal of identifying high-quality food items.
  - Using Spark to build different recommendation systems for recipes and foods.
- <a href="http://deepspace.jpl.nasa.gov/">NASA JPL - Deep Space Network</a>
- <a href="http://www.163.com/">Netease</a>
- <a href="http://www.nflabs.com">NFLabs</a>
- <a href="http://nsn.com">Nokia Solutions and Networks</a>
- <a href="http://www.nttdata.com/global/en/">NTT DATA</a>
- <a href="http://www.nubetech.co">Nube Technologies</a>
  - Nube provides solutions for data curation at scale helping customer targeting, accurate 
  inventory and efficient analysis.
- <a href="http://ooyala.com">Ooyala, Inc.</a> – Powering personalized video experiences 
across all screens
  - See our blog post on how we use 
  <a href="http://engineering.ooyala.com/blog/fast-spark-queries-memory-datasets">Spark for 
  Fast Queries</a>
  - See our presentation on 
  <a href="http://www.slideshare.net/EvanChan2/cassandra2013-spark-talk-final">Cassandra, Spark, 
  and Shark</a>
- <a href="http://www.opentable.com/">Opentable</a>
  - Using Apache Spark for log processing and ETL. The data obtained feeds the recommender 
  system powered by Spark MLLIB Matrix Factorization. We are evaluating the use of Spark 
  Streaming for real-time analytics.
- <a href="http://pantera.io">PanTera</a>
  - PanTera is a tool for exploring large datasets. It uses Spark to create XY and geographic 
  scatterplots from millions to billions of datapoints.
  - Components we are using: Spark Core (Scala API), Spark SQL, and GraphX
- <a href="http://www.peerialism.com">Peerialism</a>
- <a href="http://www.planbmedia.com">PlanBMedia</a>
- <a href="http://prediction.io/">PredicitionIo</a> - PredictionIO currently offers two engine 
templates for Apache Spark MLlib for recommendation (MLlib ALS) and classification (MLlib Naive 
Bayes). With these templates, you can create a custom predictive engine for production deployment 
efficiently.
- <a href="http://premise.com">Premise</a>
- <a href="http://www.quantifind.com">Quantifind</a>
- <a href="http://radius.com">Radius Intelligence</a>
  - Using Scala, Spark and MLLib for Radius Marketing and Sales intelligence platform including 
  data aggregation, data processing, data clustering, data analysis and predictive modeling of all 
  US businesses.
- <a href="http://www.realimpactanalytics.com/">Real Impact Analytics</a>
  - Building large scale analytics platforms for telecoms operators
- <a href="http://rocketfuel.com/">RocketFuel</a>
- <a href="http://www.rondhuit.com/">RONDHUIT</a>
  - Machine Learning with Apache Mahout and Spark 
  <a href="http://www.rondhuit.com/services/training/mahout-ML.html">http://www.rondhuit.com/services/training/mahout-ML.html</a>
- <a href="http://www.sailthru.com/">Sailthru</a>
  - Uses Spark to build predictive models and recommendation systems for marketing automation 
  and personalization.
- <a href="http://www.sisa.samsung.com/">Samsung Research America</a>
- <a href="http://www.shopify.com/">Shopify</a>
- <a href="http://www.simba.com/">Simba Technologies</a>
  - BI/reporting/ETL for Spark and beyond
- <a href="http://www.sinnia.com">Sinnia</a>
- <a href="http://www.sktelecom.com/en/main/index.do">SK Telecom</a>
  - SK Telecom analyses mobile usage patterns of customer with Spark and Shark.
- <a href="http://socialmetrix.com/">Socialmetrix</a>
- <a href="http://www.sohu.com">Sohu</a>
- <a href="http://www.stratio.com/">Stratio</a>
  - Offers an open-source Big Data platform centered around Apache Spark.
- <a href="https://www.taboola.com/">Taboola</a> – Powering 'Content You May Like' around the web
- <a href="http://www.techbase.com.tr">Techbase</a>
- <a href="http://tencent.com/">Tencent</a>
- <a href="http://www.tetraconcepts.com/">Tetra Concepts</a>
- <a href="http://www.trendmicro.com/us/index.html">TrendMicro</a>
- <a href="http://engineering.tripadvisor.com/using-apache-spark-for-massively-parallel-nlp/">TripAdvisor</a>
- <a href="http://truedash.io">truedash</a>
  - Automatic pulling of all your data in to Spark for enterprise visualisation, predictive 
  analytics and data exploration at a low cost.
- <a href="http://www.trueffect.com">TruEffect Inc</a>
- <a href="http://www.tuplejump.com">Tuplejump</a>
  - Software development partners for Apache Spark and Cassandra projects
- <a href="http://www.ucsc.edu">UC Santa Cruz</a>
- <a href="http://missouri.edu/">University of Missouri Data Analytics and Discover Lab</a>
- <a href="http://videoamp.com/">VideoAmp</a>
  - Intelligent video ads for online and television viewing audiences.
- <a href="http://www.vistarmedia.com">Vistar Media</a>
  - Location technology company enabling brands to reach on-the-go consumers
- <a href="http://www.yahoo.com">Yahoo!</a>
- <a href="http://www.yandex.com">Yandex</a>
  - Using Spark in 
  <a href="http://www.searchenginejournal.com/yandex-islands-markup-issues-implementation/71891/">Yandex Islands</a>, 
  to process islands identified from a search robor
- <a href="http://www.zaloni.com/products/">Zaloni</a>
  - Zaloni's data lake management platform (Bedrock) and self-service data preparation solution 
  (Mica) leverage Spark for fast execution of transformations and data exploration.