Skip to content

Introduction

Alien Giraffe seamlessly integrates with your existing data sources, providing a unified interface for accessing data across your organization. This section covers how to connect and configure various data sources.

Connect to PostgreSQL databases with enterprise-grade security:

  • Creating read-only database users
  • Configuring column-level access controls
  • Setting up data masking for sensitive information
  • Best practices for production deployments

Access data stored in S3 buckets:

  • Support for Parquet, CSV, and JSON formats
  • Partitioned dataset handling
  • Cost-optimized query execution

All data sources in Alien Giraffe follow a consistent configuration pattern:

  1. Connection Details - Endpoint, credentials, and authentication
  2. Security Settings - Access controls and encryption
  3. Performance Tuning - Connection pooling and query optimization
  4. Schema Mapping - How your data maps to Alien Giraffe schemas

Regardless of the data source:

  • Use Read-Only Access - Create dedicated read-only users or roles
  • Implement Least Privilege - Grant access only to required tables/datasets
  • Enable Encryption - Use SSL/TLS for all connections
  • Rotate Credentials - Regularly update access keys and passwords
  • Audit Access - Monitor and log all data access patterns
  1. Choose your primary data source from the guides above
  2. Follow the step-by-step configuration instructions
  3. Define schemas for your data
  4. Start querying with the Alien Giraffe Python API