Pyspark

See every discussion that mentions Pyspark

Brand Details

Type: Product

Description: PySpark is an open-source application programming interface (API) that allows Python programmers to interact with Apache Spark. Apache Spark is a powerful, widely used framework for large-scale data processing and machine learning. PySpark bridges the gap, letting users leverage Spark's capabilities using Python. It enables large-scale data processing, real-time data processing (Spark Streaming), machine learning (MLlib), structured data processing (Spark SQL and DataFrames), and integration with other tools like Jupyter Notebooks and Pandas.

Website: https://spark.apache.org/

Mention Analytics

Total Mentions: 13

Subreddit Mentions:

Positive Mentions: 10

Negative Mentions: 0

Sign up to filter, search, and track the sentiment of all brand mentions for Pyspark over time.

NeutralGeneral Discussion

💬 162 comments

⬆︎ 601 upvotes

Mentioned in context of an interview

Found in /r/SQL/
PositiveGeneral Discussion

💬 8 comments

⬆︎ 12 upvotes

Discusses the benefits of the language for certain tasks.

Found in /r/SQL/
NeutralGeneral Discussion

💬 10 comments

⬆︎ 0 upvotes

Mentioned as part of the tech stack.

Found in /r/SQL/

Mar 26, 2025

SQL interview prep

PositiveCareer Advice

💬 18 comments

⬆︎ 36 upvotes

User mentions the need to be familiar with this brand

Found in /r/SQL/

Mar 13, 2025

Pyspark like interface to postgres

PositiveSoftware Recommendation

💬 3 comments

⬆︎ 1 upvotes

The author states they have been using pyspark for 6 years and have grown accustomed to its interface and likes the select, col, groupBy, etc.

Found in /r/SQL/

Dec 12, 2024

Arguments against colleagues that say that SQL could be ‘terminated’

PositiveGeneral Discussion

💬 67 comments

⬆︎ 32 upvotes

Pyspark is mentioned as a tool used by modern data engineers, but not in a comparative context with SQL.

Found in /r/SQL/

Nov 26, 2024

Alternatives to SQL? Are there even any?

PositiveSoftware Recommendation

💬 50 comments

⬆︎ 6 upvotes

It is mentioned as an alternative to SQL.

Found in /r/SQL/

Jun 30, 2024

Much faster to COPY big datasets and manipulate in SQL vs using Pandas first

PositiveSoftware Recommendation

💬 22 comments

⬆︎ 48 upvotes

A commenter suggested Pyspark or NumPy as alternatives, but didn't provide enough information to determine if they would be faster.

Found in /r/SQL/

Jul 17, 2022

10 + Years of T-SQL time to learn Python?

PositiveCareer Advice

💬 57 comments

⬆︎ 84 upvotes

Mentioned positively in the context of Python for data analysis and ETL processes inside of Synapse Analytics or Databricks.

Found in /r/SQL/

Feb 18, 2022

Is SQL still has viable as it was several years ago?

NeutralCareer Advice

💬 84 comments

⬆︎ 64 upvotes

Mentioned as a data analytics tool

Found in /r/SQL/

Subscribe to our newsletter!

Stay up to date on how the latest changes in AI might impact your marketing plan.