Problem: Given an employees table with (emp_id, name, dept, salary),
return all employees with salary > 50000, sorted by salary desc.
python — editable
from pyspark.sql.functions import col
employees = spark.read.parquet("employees/")
result = employees \
.filter(col("salary") > 50000) \
.select("emp_id", "name", "dept", "salary") \
.orderBy(col("salary").desc())
result.show()