Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature] Add the 13f data directly from the FMP #6956

Open
wants to merge 24 commits into
base: develop
Choose a base branch
from

Conversation

joshuaBri
Copy link
Contributor

Why?:

Expand the 13f data directly from the FMP
What?:

Adds fmp as a provider to, obb.equity.ownership
Impact:

Get Data of 13FHR from fmp provider

@CLAassistant
Copy link

CLAassistant commented Nov 22, 2024

CLA assistant check
All committers have signed the CLA.

@joshuaBri joshuaBri changed the title Feature/13f [Feature] Add the 13f data directly from the FMP Nov 22, 2024
Copy link
Contributor

@deeleeramone deeleeramone left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hi, thanks for the PR. It's off to a nice start, let's bring it home!

There are some FMP-related issues that become ours' without getting out ahead of them.

  • For this function, the symbol input needs to accept both a ticker symbol and a CIK number. There needs to be a translation/check for this.
    • obb.equity.ownership.form_13f(symbol="0001045810", provider="fmp")
    • obb.equity.ownership.form_13f(symbol="NVDA", provider="fmp")
  • Getting the most recent filing shouldn't require a date, and when a date is entered, it needs to find the appropriate one - i.e, a successful request shouldn't hinge on knowing the exact right date.
  • We want to be able to get more than one at a time (from the same CIK, not so much multiple CIKs) so that the changes can be easily observed. The "limit" param can be a convenience method for the user to fetch the most recent N filing.

"asset_class": "titleOfClass",
"filling_date": "fillingDate",
"accepted_date": "acceptedDate",
"ticker_cusip": "tickercusip",
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

"ticker_cusip" -> "symbol"

"ticker_cusip": "tickercusip",
"final_link": "finalLink",
}
filling_date: dateType = Field(
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Where fields have default=None, they need to be annotated as, Optional[type]

query: FMPForm13FHRQueryParams, data: List[Dict], **kwargs: Any
) -> List[FMPForm13FHRData]:
"""Return the transformed data."""
return [FMPForm13FHRData(**d) for d in data]
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

FMP is returning an additional field, "cik", which we do not want because it is the filer's CIK and having this value makes things both confusing and redundant. Please pop that value before returning the response.

query,
exclude=["symbol", "limit"],
)
result = await amake_request(url, **kwargs)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Here, we can handle API authorization errors if we pass a specific response_callback helper function to amake_request

from openbb_fmp.utils.helpers import response_callback
result = await amake_request(url, response_callback=response_callback, **kwargs)

Obviously, you wouldn't be likely to know this, but it will ensure that authorization error messages get properly communicated to the user.

if result:
results.extend(result)

await asyncio.gather(*[get_one(symbol) for symbol in symbols])
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

As noted already, we are more interested in getting multiple filings from the same symbol, rather than the same date for multiple symbols. If we only allow 1 symbol, but multiple dates, we'll need to handle each date for the given symbol.



@pytest.mark.record_http
def test_fmp_form_13f_fetcher(credentials=test_credentials):
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In, openbb_platform/extensions/equity/integration, there are two test files for the integration tests, API and Python. A full set of parameters (can be None if the param is optional) with provider='fmp' needs to be added to the params of the endpoint test. Insert them below the "sec" provider params.

@joshuaBri
Copy link
Contributor Author

joshuaBri commented Nov 25, 2024

Thanks a lot for your correction.But I still have some troubles:
I can't see through how provider "sec" fix the problem with ticker-to-cik map,or it can fetch the data directly through ticker.But FMP can't fetch the data directly through ticker. I have to get the cik first.
I can fetch the related data through the url --https://www.sec.gov/files/company_tickers.json
Can I use it in coding ?
image

@deeleeramone
Copy link
Contributor

Thanks a lot for your correction.But I still have some troubles: I can't see through how provider "sec" fix the problem with ticker-to-cik map,or it can fetch the data directly through ticker.But FMP can't fetch the data directly through ticker. I have to get the cik first. I can fetch the related data through the url --https://www.sec.gov/files/company_tickers.json Can I use it in coding?

No, you'll need to use FMP endpoints for FMP data - https://site.financialmodelingprep.com/developer/docs/cik-search-company-search

@joshuaBri
Copy link
Contributor Author

joshuaBri commented Nov 26, 2024

Thanks a lot for your correction.But I still have some troubles: I can't see through how provider "sec" fix the problem with ticker-to-cik map,or it can fetch the data directly through ticker.But FMP can't fetch the data directly through ticker. I have to get the cik first. I can fetch the related data through the url --https://www.sec.gov/files/company_tickers.json Can I use it in coding?

No, you'll need to use FMP endpoints for FMP data - https://site.financialmodelingprep.com/developer/docs/cik-search-company-search
Sorry, I don't think you get my point. For example , I want to get the cik of ticker "NVDA". Where can I get the cik for this ticker?
The endpoint you mentioned can't help me to get the cik. If i don't have cik,I can't use the FMP endpoints to fetch 13F data.

@deeleeramone
Copy link
Contributor

Sorry, I don't think you get my point. For example , I want to get the cik of ticker "NVDA". Where can I get the cik for this ticker?

I understand, personally I find their documentation and labeling to be confusing, perhaps this one is better.

https://site.financialmodelingprep.com/developer/docs/institutional-holders-search-api

@joshuaBri
Copy link
Contributor Author

joshuaBri commented Nov 26, 2024

Sorry, I don't think you get my point. For example , I want to get the cik of ticker "NVDA". Where can I get the cik for this ticker?

I understand, personally I find their documentation and labeling to be confusing, perhaps this one is better.

https://site.financialmodelingprep.com/developer/docs/institutional-holders-search-api

Yeah, I recently tried this endpoint, but it’s quite confusing. According to their documentation, you can 'search for institutional investment managers by name, ticker symbol, or CUSIP number.' However, I couldn’t get results for some of the most common symbols like 'AAPL' or 'NVDA.'
I already sent email to ask FMP about this endpoint.

@deeleeramone
Copy link
Contributor

Yeah, I recently tried this endpoint, but it’s quite confusing. According to their documentation, you can 'search for institutional investment managers by name, ticker symbol, or CUSIP number.' However, I couldn’t get results for some of the most common symbols like 'AAPL' or 'NVDA.' I already sent email to ask FMP about this endpoint.

OK, let's see what they have to say about it, we'll circle back to handling that one thing when we get an answer. Worst case scenario will be to use a field_validator and reject input to FMP that is not numeric.

@joshuaBri
Copy link
Contributor Author

FMP replied me . https://financialmodelingprep.com/api/v4/institutional-ownership/symbol-ownership?symbol=AAPL&includeCurrentQuarter=false
This endpoint could work out . And I already corrected my code in the latest commit. Please check it when you are available.

@deeleeramone
Copy link
Contributor

deeleeramone commented Nov 27, 2024

FMP replied me . https://financialmodelingprep.com/api/v4/institutional-ownership/symbol-ownership?symbol=AAPL&includeCurrentQuarter=false This endpoint could work out . And I already corrected my code in the latest commit. Please check it when you are available.

This one is not the same thing; this aggregates institutional ownership of an entity, whereas Form 13F is an entity's ownership of another.

Also, it already exists here: https://docs.openbb.co/platform/reference/equity/ownership/institutional

@joshuaBri
Copy link
Contributor Author

FMP replied me . https://financialmodelingprep.com/api/v4/institutional-ownership/symbol-ownership?symbol=AAPL&includeCurrentQuarter=false This endpoint could work out . And I already corrected my code in the latest commit. Please check it when you are available.

This one is not the same thing; this aggregates institutional ownership of an entity, whereas Form 13F is an entity's ownership of another.

Also, it already exists here: https://docs.openbb.co/platform/reference/equity/ownership/institutional

I konw this endpoint aggregates other data.But that's exactly what FMP tell me. If I can't use this one to get cik ,what can I use to get it?
img_v3_02h1_598751eb-128f-4a1d-9864-559e6c5c81dg

@deeleeramone
Copy link
Contributor

deeleeramone commented Dec 5, 2024

I konw this endpoint aggregates other data.But that's exactly what FMP tell me. If I can't use this one to get cik ,what can I use to get it?

Something like this should work, @joshuaBri!

from openbb_core.app.model.abstract.error import OpenBBError
from openbb_fmp.models.equity_profile import FMPEquityProfileFetcher

api_key = credentials.get("fmp_api_key") if credentials else ""
if query.symbol.isnumeric() is False:
    try:
        profile = await FMPEquityProfileFetcher.fetch_data({"symbol": query.symbol}, {"fmp_api_key": api_key})
        cik = getattr(profile[0], "cik", "")
        if not cik:
            raise OpenBBError(f"Invalid symbol -> {query.symbol}")
    except OpenBBError as e:
        raise OpenBBError(f"Invalid symbol -> {query.symbol} -> {e}")
else:
    cik = query.symbol

The whole block, and the other imports only used within aextract_data can go right in the function.

@joshuaBri
Copy link
Contributor Author

I konw this endpoint aggregates other data.But that's exactly what FMP tell me. If I can't use this one to get cik ,what can I use to get it?

Something like this should work, @joshuaBri!

from openbb_core.app.model.abstract.error import OpenBBError
from openbb_fmp.models.equity_profile import FMPEquityProfileFetcher

api_key = credentials.get("fmp_api_key") if credentials else ""
if query.symbol.isnumeric() is False:
    try:
        profile = await FMPEquityProfileFetcher.fetch_data({"symbol": query.symbol}, {"fmp_api_key": api_key})
        cik = getattr(profile[0], "cik", "")
        if not cik:
            raise OpenBBError(f"Invalid symbol -> {query.symbol}")
    except OpenBBError as e:
        raise OpenBBError(f"Invalid symbol -> {query.symbol} -> {e}")
else:
    cik = query.symbol

The whole block, and the other imports only used within aextract_data can go right in the function.

Thanks,I changed my codes . Please check the latest commit.@deeleeramone

@joshuaBri
Copy link
Contributor Author

@deeleeramone
image
It seems I need to upload this yaml file,Please help me to generate it,thanks

if not dates:
raise EmptyDataError("No data returned for the given symbol.")
if not dates or len(dates) == 0:
warn(f"Symbol Error: No data found for symbol {cik}")
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We don't need to warn here because we are raising in the statement above.

@joshuaBri
Copy link
Contributor Author

@deeleeramone Still need the yaml file,please chreck it again,thanks
image

@deeleeramone
Copy link
Contributor

deeleeramone commented Jan 1, 2025

Still need the yaml file,please chreck it again,thanks

The logic for "limit" and "date" do not appear to be working. The date(s) provided by the user need to be nearest corresponding valid date, and then it should return the entire contents for that date(s).

Screenshot 2024-12-31 at 5 07 40 PM

@joshuaBri
Copy link
Contributor Author

Got it.I changed it . But I still have some questions. For Example, if you run : obb.equity.ownership.form_13f(provider="fmp", symbol=="brk-a", date="2024-12-25", limit=100).to_df()
But fmp doesn't provide the data for "2024-12-31",then I should warn the date is not right . Am I right about this?
please rerun your check. Thanks.

@joshuaBri
Copy link
Contributor Author

joshuaBri commented Jan 10, 2025

Still need the yaml file,please chreck it again,thanks

The logic for "limit" and "date" do not appear to be working. The date(s) provided by the user need to be nearest corresponding valid date, and then it should return the entire contents for that date(s).

Screenshot 2024-12-31 at 5 07 40 PM

@deeleeramone pls have a look.I changed already

@deeleeramone
Copy link
Contributor

Got it.I changed it . But I still have some questions. For Example, if you run : obb.equity.ownership.form_13f(provider="fmp", symbol=="brk-a", date="2024-12-25", limit=100).to_df() But fmp doesn't provide the data for "2024-12-31",then I should warn the date is not right . Am I right about this? please rerun your check. Thanks.

It's not that "FMP doesn't provide the data for 2024-12-31", more so that the data does not exist. You could issue a warning if the date varies dramatically, but as this form is filed quarterly, delayed, it shouldn't really matter to user that the data does not match the exact date.

"limit" and "date" would not really be used in the same command. Here, "limit" means, "the most recent N number of reports". It does not mean, "the number of rows". There is not a scenario where limiting the number of rows returned from a report is actually desirable. The default state should be the latest report.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants