Clean and ready-to-use dataset of RUSSELL 1000 equities, fully compatible with yfinance, enriched with market data and ICB classification.
The file russell1000_full_enriched.csv contains the following columns:
Ticker: base tickerCompany: company namemarket_cap: market capitalization (when available)
GICSSector: sector classificationGICS Sub-Industry: detailed subsector classification
import pandas as pd
import yfinance as yf
df = pd.read_csv("russell1000_full_enriched.csv")
tickers = df["Ticker"].dropna().unique().tolist()
data = yf.download(tickers[:10], period="1y", interval="1d")Stock screening (by sector, industry, market cap, etc.) Factor research and quantitative analysis Backtesting strategies on Russell1000 equities Portfolio construction & diversification studies Financial data analysis in Python
Scraped official Russell1000 listings Mapped tickers to Yahoo Finance format Validated tickers using yfinance price availability Enriched dataset with: Market capitalization GICS classification (industry → subsector) Company names and ISIN mapping
Some fields (notably market_cap) may be missing or partially outdated due to Yahoo Finance limitations All tickers are validated against live data availability in yfinance Dataset is focused on Russell1000 listings only
Python 3.x pandas yfinance
pip install pandas yfinanceThis dataset is provided for informational and research purposes only. It does not constitute financial or investment advice.