Skip to content

Piyush Routray

  • Twitter
  • LinkedIn
  • About Piyush

Tag: Data Engineering

PySpark DataFrame ordering based on multiple columns

How to deal with the requirement to order a DataFrame using more than one column simultaneously? Also, consider some values … More

Data Engineering, Databricks, PySpark

How to encrypt files before sharing online?

As a Data Engineer, one faces the need to share files securely over the internet. An easy way of doing … More

Data Engineering, PGP, security

Optimized Import of Text Data for Analytics

Text data analysis is a staple use case for the Data Analytics world! There are multiple firms which enable capture … More

AWS Firehose, AWS S3, Data Engineering, Snowflake

Browse by Category

ambari AWS Firehose AWS S3 centOS 7 cloudera cron Databricks Data Engineering datanode DevOps docker hadoop HDP hive Hortonworks java jenkins kubernetes linux OpenJDK OracleJDK PGP PySpark Python s3cmd security Snowflake stackoverflow Windows

Enter your email address to follow this blog and receive notifications of new posts by email.

Join 433 other subscribers
My Tweets
Create a website or blog at WordPress.com
  • Follow Following
    • Piyush Routray
    • Already have a WordPress.com account? Log in now.
    • Piyush Routray
    • Customize
    • Follow Following
    • Sign up
    • Log in
    • Report this content
    • View site in Reader
    • Manage subscriptions
    • Collapse this bar