厦大数据库实验室博客 ·

PySpark 读写Hive数据源

💡 原文中文，约4600字，阅读约需11分钟。

📝

内容提要

本文介绍了在Windows下配置Spark访问Hive的步骤，以及使用SparkSession和HiveContext读写Hive数据的方法。同时提到了在IDE环境中配置Python开发环境的步骤。

🎯

🏷️

Meta暂停青少年与其AI角色聊天
Meta is "temporarily pausing" the ability for teens to chat with its ...
某二次元打灰游戏虚拟机检测绕过和nvme性能优化的libvirt配置
免责声明：我只是为了愉快的在自建的云游戏串流虚拟机上进行远程游戏，用虚拟机是因为All-in-boom宿主机还… 继续阅读某二次元打灰游戏虚拟机检测绕过和...
TikTok新所有者对你的信息流意味着什么
TikTok is officially under new ownership in the US, and that could spell big ...
宣布Databricks Delta Sharing对Iceberg格式的一级支持
With more than 300% year-on-year usage growth for 2 consecutive years, Delta ...
CNCF：Kubernetes是AI的‘基础’基础设施
The latest (CNCF) Annual Cloud Native Survey has been released, and with “82...
卡西欧推出了一款复古游戏风格的采样器
Casio showed up to NAMM (CES for music gear nerds) this year with a prototype...